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PREFACE 


The contents of this book are selected from material which I have presented 
in numerous courses on the calculus of variations at the University of Chicago. 
They are a selection of related portions rather than a complete account. 

The purposes of the book are two: first, the presentation of the notions and 
principles underlying modern theories of the calculus of variations in as simple 
and yet as inclusive a form as possible; and, second, the development, as com- 
pletely as is now possible, of the theory of one of the most comprehensive prob- 
lems of the calculus of variations which have so far been formulated. For these 
purposes the book is divided into two parts — Part I, entitled “Simpler Problems 
of the Calculus of Variations,” and Part II, on “The Problem of Bolza.” 

Part I consists of six chapters which deal with relatively elementary problems 
of the calculus of variations. In mimeographed form these have often been 
used as a text for beginning courses in the subject. In the first three chapters 
I have studied the fundamental problem of the calculus of variations in three- 
dimensional space, the problem with fixed end-points. I have chosen to begin 
with the problem in three dimensions because its results are extensible, with 
little more than the formal notational changes shown in Chapters IV and V, 
to the corresponding problems in the plane and in higher dimensional spaces 
and to problems in parametric form. Such extensions cannot be made so con- 
veniently if one starts with the simplest problem in the plane and tries to 
generalize from there. Chapter VI is an introduction to the theory of problems 
with variable end-points — problems which in the past have received much less 
attention than those for which the end-points are fixed. 

When the book is used as a text for a course, illustrative examples will alwa3rs 
be found valuable. I have listed very few in the text, and there are not many 
whose details have been worked out with any completeness in the light of 
modem theory. As references, one could consult Bliss, CalctUus of Variations^ 
where several problems in the plane have been discussed in more than usual 
detail, and Bolza, Vorlesungen iiber V ariationsrechnung, which has a rich list. 
In Bolza ’s book the problems are collectively enumerated in the Index, but for 
a particular problem the complete discussion is usually scattered, in parts, 
through the text. Older books, such as Jellet, An Elementary Treatise on the 
Calculus of Variations^ list many problems, but usually with few details. It 
would be a great service to students of the calculus of variations to have a list 
of the problems of the calculus of variations which have been exhaustively 
studied in the light of modem theories and to have such extensions of the list 
as might be made by an experienced student. 

Oskar Bolza was professor of mathematics at the University of Chicago for 
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nearly eighteen years. Most of his research during that period was in the 
calculus of variations. In 1913 he formulated a very general problem in the 
subject which has frequently been referred to as ‘‘the problem of Bolza.” It 
includes as special cases all but a veiy few of the hitherto formulated problems 
of the calculus of variations involving only so-called “simple” integrals, as 
contrasted with multiple integrals. Little was done toward the systematic 
study of the problem of Bolza until about 1930. Since that time it has been the 
subject of many memoirs, whose results have made it possible to present here, 
for the first time in book form, a relatively complete treatment. With the re- 
sults of the study here made, the theory of the most general problems having 
variable end-points is brought to a stage of advancement comparable to that 
achieved up to this time for problems whose end-points are fixed. The Bibliog- 
raphy, at the end of the book, is evidence of the great interest which the problem 
of Bolza has aroused among mathematicians in the past and which it will 
doubtless continue to arouse in the future. 

Throughout the theory of the calculus of variations applications of the 
existence theorems for implicit functions and differential equations are fre- 
quent and fundamental. These theorems are now well known, but in the litera- 
ture they are scattered and not in convenient form. I have therefore added an 
Appendix at the end of the book, in which I have endeavored to simplify proofs 
and record their results in theorems, which, it is hoped, will be especially 
convenient and useful for students of the calculus of variations. 

The Bibliography of the Problem of Bolza (pp. 287-91) includes the names 
of numerous mathematicians who in recent years have been interested in the 
theory. I am indebted to many of them for details presented in this book, but 
especially to H. H. Goldstine, L. M. Graves, M. R. Hestenes, E. J. McShane, 
Marston Morse, and W. T. Reid. Both Hestenes and Goldstine, during periods 
in which they were my research assistants, contributed most valuable assistance 
in the development of the theory and in the preparation of the manuscript of 
this book. 

Gilbert A. Bliss 

University of Chicago 
April 1945 
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PART I 

SIMPLER PROBLEMS OF THE CALCULUS OF VARIATIONS 




CHAPTER I 

THE CALCULUS OF VARIATIONS IN THREE-SPACE 


1. The nature of problems of the calculus of variations. A fundamental prob- 
lem of the differential calculus is that of finding in the range of values of an inde- 
pendent variable x one for which a given function f{x) is a maximum or a mini- 
mum. The calculus of variations is also a theory of maxima and minima, but 
for variables and functions which are more complicated than those of the differ- 
ential calculus. A relatively simple illustration is afforded by the problem of 
finding, in a class of arcs joining two given points in rcyz-space, one which has the 
shortest length. The independent variable in this case may be designated by 
the symbol C, and its range is the class of arcs joining the two given points. 
The function to be minimized is the function /(C), whose value for each arc C 
is the length of that arc. 

For the present it will be supposed that the arcs of the class in which a mini- 
mum is sought are all in icyz-space and defined by functions of the form 

(1-1) 2(:r) (xi^xS^Xz) 

which have continuous derivatives. For the problem of the last paragraph the 
function to be minimized is then the length integral 

7(0 = P[H-y'2 + 2/2]i/2^^^ 

whose value is determined when the derivatives y'(x), z'{x) of the functions 
y{x)yz{x) defining Care substituted in its integrand. Analytically the problem is 
that of finding, in a class of pairs of functions y{x)y z{x) having given values 

yi = y (^i) , zi = 2 ixi) , 

y2 = y (^ 2 ) , Z2 = 2 iX2) 

at xi and X 2 y one which gives the integral /(C) its minimum value. 

A second example from the calculus of variations, which may also be cited 
here as an illustration, is the so-called “brachistochrone problem,” originally 
discussed by Galileo.^ If an initial velocity Vi and two fixed points, 1 and 2, in 
space are given, then the time of descent of a heavy particle starting at 1 with 
the velocity vi and falling, under the action of gravity alone, along an arc C to 
the point 2 is uniquely determined by the form of the arc C. The problem in 
question is that of finding, in a class of arcs C joining the points 1 and 2, one 

^ See Galileo, Dialog iiber die beiden hauptsdchlichsten Wdtsysteme (1630), trans. Strauss, 
pp. 471-72; and his Dialogues concerning Two New Sciences (1638), trans. Crew and De 
Salvio, p. 239. 
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down which the particle will fall in the shortest time. The solutions of this 
famous old problem by the Swiss mathematicians John and James Bernoulli* 
gave rise to the first systematic theory of the calculus of variations. The name 
of the problem is derived from the Greek words jSpax^crros, meaning “shortest/’ 
and xpovosy meaning “time.” 

An analytic expression for the time of descent can readily be found when a 
system of axes is taken as shown in Figure 1*1. Let m be the mass of the falling 
particle P, and let s be the length of the arc C from 1 to the position of P at the 



time /. Then the force on P in the direction of the tangent to C has the expres- 
sions 

cos e = mg 

where g is the well-known gravitational constant. When this equation is multi- 
plied by 2ds/dt and integrated, it is found that 

(1-3) (^y=2g2 + c = 2g(2-a) 


where a = —cl2g and, in terms of the initial coordinate Zi and initial velocity Vi 
at the point /, the constant a has the value 


found by setting / = /i in equation (1 *3). By integrating equation (1*3) again, 
the time of descent is found to be l/(2g)^/* times the value of the integral 


(1*4) 



z — a) s 



1+y'^+z 
z~“ a 


/2 



If /(C) represents the value of the integral in the last expression taken along the 
arc C, the problem can be stated as that of finding, in a suitably specified class 
* For these solutions see Ostwald, Klassiker der exakten Wissensckaflen, No. 46. 
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of arcs C joining the points I and 2, one which minimizes the integral 1(C). 
This is a problem similar to that of finding the shortest arc, but with a different 
integral function 1(C) to be minimized. 

The two problems which have been formulated above are special cases of the 
more general problem of finding, in a class of arcs 

(1*5) 3'W, « W (xi^x^X2) 

joining two fixed points I and 2 in acyz-space, one which minimizes an integral 
of the form 

(1-6) 7(C) = n/(x, y, 2 , y', z') dx . 

The value of the integral along an arc C is to be found by substituting y(x), 
z(x) and their derivatives y'(:r), z\x) in place of the variables y, 2 , y', 2 ' so 
that the integrand becomes a function of x alone. This is the problem which 
will be studied in the present chapter. For the special case of determining 
the shortest arc joining two points the integrand function / has the value 
/ = [1 + y'^ + 2 ' 2 ]‘/^; and for the brachistochrone problem,/ = [(1 + y'®+ 2 '*)/ 
(2 — There are, of course, many other special cases of interest included 
in the general problem formulated above. 

It should be remarked here that there are other theories of the calculus of 
variations which apply to problems of many types that are different from those 
just described. For example, one may restrict the class of arcs in which a mini- 
mum for the brachistochrone problem is sought to be those which join two 
given points and have a given length. Analytically the problem is that of 
finding, in the class of arcs (1 • 5) joining two given points 1 and 2 and giving the 
length integral 

/= /'*’[! +y'2+2'2]i/2rf;c 

a fixed va.lue, one which minimizes the integral 1 of equation (1-4). Such prob- 
lems, in which one integral 1 is to be minimized in a class of arcs on which a 
second integral / has a fixed value, are called “isoperimetric problems.” Other 
problems involve multiple integrals. If one seeks a surface of minimum area 
in a class of surfaces bounded by a fixed simply closed curve C in .ryz-space, the 
integral to be minimized may be taken in the form 

^ =// [ ^ +(ll) +(l^) ] ^ = 

and the surfaces over which it is to be evaluated, in the form 2 — z(x, y). Some 
of these problems will be considered in more detail in later chapters. 

The problem in three-space associated with the integral (1-6) has been select- 
ed for preliminary study because its theory illustrates well the methods which 
are applicable in more complicated cases and because many of the important 
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theorems for the problems which will be studied in later chapters can be deduced 
with very little effort when the theory of the problem in three-space has been 
mastered. 

2. The origin of the name ‘‘calculus of variations.” The name of the theory 
which is to be studied here, the “calculus of variations,” was adopted as a 
result of notations which were introduced by Lagrange about the year 1760. 
In order to compare the value of an integral 1(C) along an arc C with its value 
along a neighboring arc, he altered the functions y(x)y z(x) defining C, by 
the addition of increments Sy(x), Bz(x) which are also functions of x and which 
are called variations of the functions y(x)^ z(x). If these increment functions 
vanish at Xi and x^^ the curve defined by the functions 

y(x) + By(x), z(x) + 6z(x) (xi^x^X 2 ) 

will still pass through the end-points of C. For convenience, this new curve may 
be denoted by the symbol C + 5C. The problem to be solved is, then, that of 
finding, in a given class of arcs joining the end-points of C, one such that the 
inequality 

A/ = /(C+5C)-/(C)^0 

holds for every pair of variations 6y, 8z defining an arc C +6C also in the class. 
In attacking this problem writers on the calculus of variations have often used an 
expansion of the difference 

A/= /" \f(x, y+dy, z+6z, y'+5y', z'+5z') -J(x, y, z, y', z') ]dx 
•'*1 

in the form 

A/ «»/+... . 

Here 67, 6*7, . . . represent integrals of homogeneous polynomials of the first, 
second, and higher orders in 6y, 6z and their derivatives 6y', 6z' with respect to 
x^ found by expanding the integrand of 7 into a Taylor series. The expressions 
67, 6*7, ... are called the first, second, and higher variations of the integral 7. 
These are the notations of Lagrange from which the theory of maxima and 
minima of functions such as 7(C) came to be called the “calculus of variations.” 
At the present time many of the results of the theory can be obtained more 
readily without extensive use of the 6 notations. 

There are analogies between the variations 6y, 6z of the functions y(a;), z(x) 
and the differential dx of an independent variable x of the differential calculus, 
and between the variations 67, 6*7, . . . and the differentials df, d*/, ... of a 
function f(x). In the years between 1800 and 1850 these analogies were much 
studied,* and the literature of that period indicates a strong desire to unify 

* See, e.g., Strauch, Theorie und Anwendung des sogenannten VariationscalcuVst Vols. I 
and II (1854), and many other writers. 
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the methods of the differential calculus and the calculus of variations. Analysis 
of the type elaborately developed at that time seems, however, to have been 
relatively unproductive of important results. Recently new and much more 
significant attempts have been made to identify the theories of maxima and 
minima in the calculus and the calculus of variations as special instances in 
more inclusive theories of general analysis or functional calculus, but these 
developments are still in many respects in a formative stage.* 

3. Analytic formulation of the problem. Suppose that a totality of arcs C has 
been specified along each of which the integral /(C) of equation (1-6) has a 
well-determined value. The arcs of this totality will be called (i^issible^arcs. 
A problem of the calculus of variations associated with such admissible arcs 
and such an integral is then that of finding, in the class of admissible arcs 
joining two fixed points, one which gives the integral /(C) its smallest value. 
The problem so formulated is said to have fixed end-points. It may be modified 
by specifying the class of arcs in which a minimum is sought to be the class of 
admissible arcs joining a fixed point and a fixed curve, or a fixed curve and a 
fixed surface; and other possible modifications of a similar sort are evident. In 
these latter cases the problem is said to have variable end-points. 

The totality of admissible arcs may be defined in many ways when an inte- 
gral I is given, and to each of these different ways there corresponds a different 
problem of the calculus of variations. For present purposes it will be supposed, 
first of all, that there is a region R of sets of real values y, z, y', z') in which 
the integrand function /(jr, y, z, y', z') has continuous derivatives up to and in- 
cluding those of the fourth order. It may have derivatives of higher orders, but 
the ones just specified are all that are needed for the theory which is to follow.® 
A set {x, y, z, y', z') interior to the region R is called an admissible set. An arc C 
is customarily called regular if the fu nctions y {x), z{ x\ defining it arc single - 
valued an d have continuous derivativ^on the interyal The totality of 
admissible arcs to be considered here is then the totality of continuous arcs 
each of which consists of a finite number of regular sub-arcs whose elements 

♦There is an extensive literature on “functions of lines’* by Volterra. In addition, one 
should note especially the following papers, some of which contain references to other impor- 
tant papers in this field: Hahn, “Ueber die Lagrangeschen Multiplikatorenmethode,** Wiener 
BerkhtCy XIII (1922), 531-50; Sanger, “Functions of Lines and the Calculus of Variations” 
(diss.. University of Chicago), Contributions to the Calculus of Variations 193 1-1932 , Depart- 
ment of Mathematics of the University of Chkago, pp. 191-293 (this title will hereafter be des- 
ignated as Contributions); Goldstine, “Conditions for a Minimum of a Functional” (diss.. 
University of Chicago), Contributions 1933-1937, pp. 315-57; Menger, “Metric Methods in 
the Calculus of Variations,** Proceedings of the National Academy of Sciences, XXIII (1937), 
244-50; Goldstine, “A Multiplier Rule in Abstract Spaces,** Bidletin of the Amerkan Mathe- 
matkal Sockty, XLIV (1938), 388-94. 

•Third derivatives of / are, in fact, sufficient for most of the theory if the imbedding 
theorems of Section 7 are established by means of the canonical equations of the extremals 
as in Section 27 
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(x, y, 2, y\ 2 ') are all admissible. SiKh a n admis sible arc jimy have ^ finUejium- 
berrf cornra, as indicated in Figure 3 1, at which the integrand function 
z{x), y'(»), z\x)] will be discontinous, but the integral of this function 
over the interval X 1 X 2 is, nevertheless, well defined. 

As modifications of the definition of admissible arcs just given, one might 
require them to be regular, or analytic, or perhaps only rectifiable. The defi- 
nition given above, however, seems to lead to the simplest introductory theory, 
and in many geometrical and mechanical problems it is the natural one. 

The specification of the region R of sets (jc, y, z, y', z') is not an artificial one 
for purposes of analysis only. Each special problem has a largest region of this 
sort naturally associated with its integrand function f(Xy y, z, y\ z'). For the 
shortest-distance problem with the integrand function (1 + y'^ + the 
largest region R consists of all sets (x, y, z, y', z') with real finite values for the 



the five coordinates; but for the brachistochrone problem only those sets are 
admissible for which z > a, as is evident from the form of the integrand in the 
expression (1*4). This does not affect the generality of the theory of the brachis- 
tochrone problem; for equation (1*3) shows that, if a particle P starts at the 
point 1 of an arc C with the velocity its velocity will be zero when it reaches 
the altitude of the plane z = a on the curve, if the curve reaches that altitude, 
and that it can never rise above that plane. It is only necessary, therefore, to 
consider arcs which are below the plane z = a, or, as an exceptional case, those 
which go up to this plane but not above it. Similarly the integrand of the prob- 
lem of the calculus of variations which corresponds to the special relativity 
theory in the a;y-plane is 

b'-m'-mr- 

Analytically this problem is set in the /jcy-space; and the admissible elements 
(/, Xf y, x'y y') are those for which the variables /, x, y have arbitrary finite 
values but for which x' and y' are restricted by the inequality c* > x'^ + y'*, 
where it is understood that x' = dx/dt, y' = dy/dL 

For the examples just cited the region R described in each case was the 
largest in which the integrand function /{x, y, z, y', z') for the particular problem 
in question had the continuity properties required for the theory. There is no 
reason, however, why R should not be selected as a subregion of the largest one 
possible. This is frequently done when the minimizing properties of a particular 
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arc E with respect to neighboring arcs are to be studied. The region R in that 
j case may be selected as a certain neighborhood of the values (a;, y, z, y\ z') be- 
l longing to £, and an arc E which furnishes a maximim or minimum in such a 
neighborhood is said to furnish a relative maximum or minimum. In the follow- 
ing pages theorems concerning such relative extremes are deduced; but, unless 
otherwise specified, it is always understood that the region R is an arbitrarily 
selected region in which the integrand / has the desired continuity properties. 

4. The first and second variations. Consider, now, a particular admissible 
arc E, joining the points 1 and 2 and defined by functions 

y{x) ,z{x) {xi^x^xi) 

whose minimizing properties are to be tested. If a is an arbitrary constant, and 
if fW are two functions vanishing at xi and and having continuity 
properties similar to those of yix), z(jc), then every arc of the one-parameter 
family 

(41) y(x) + ari(x), zix) + at(x) (xi^x^Xi) 

passes through the end-points I and 2 of E, and the family contains E for the 
particular parameter value a = 0. Furthermore, the arcs of the family, for 
sufficiently small values of a, are all admissible, since the elements {x, y, z, y', z') 
of E are all interior to the region R; and the corresponding elements of the 
arc (4 1) will therefore also be interior to R when a is small. In the notation of 
Lagrange the equations of the arcs of the family (4- 1) are obtained from those 
\ of E by adding variations of the special forms 5y = ari{x), 5z = a^(x) to the 
functions y(x), z(x) defining E. 

When the functions (4 1) are substituted in the integrand of the integral /, 
a function of the parameter a of the form 

7 (a) = y + arj, z + af, y' + a??', z' + af')tf^ 

is obtained. It is understood that in the integrand y + arj stands for y{x) + 
art^x), and similarly for the other arguments. If the arc E furnishes a minimum 
\ value for the integral 7, then evidently 7(a) must have a minimum at a = 0, 
^iand the conditions 7'(0) = 0, 7"(0) ^ 0 must be satisfied. The values of these 
derivatives are readily calculated to be 

/'(O) = n/y^ + Af + /v V + 

7"(0) = n2u>{x, 7], r, r/' ndx, 
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where 2of is the quadratic form in rj, 17', t' whose matrix of coefficients is 



fv 

fm' 

/»»' 

u 

f.. 

fn' 

/«' 

/»'» 



fy’.‘ 






In these expressions the subscripts of / indicate partial derivatives, and the 
arguments of these derivatives are x and the functions y(x), z(x), y'(x)y z'(x) 
belonging to the arc E. 

The expressions /'(O), /"(O) are called the first and second variations of the 
integral I along the arc E. This nomenclature is not quite in agreement with 
that described in a preceding paragraph, but it is somewhat more convenient. 
The first and second variations, according to the earlier definition, would be 
and a*7"(0). Similarly, f(:r) will be called variations of y(:r), z{x) 
instead of hy = aiy, hz = a^\ and they will be designated as admissible varia- 
tions when they have the continuity properties of y(x) and z{x). When it is 
desired to indicate the dependence of the two variations of I upon the functions 
rijt, they will be denoted by /i(i;, f), /2(??, f), respectively. 

'il first necessary condition th at 1(E) be a minim um isy then, that the first and 
seconSTo^cU^^ E shall satisfy the conditions 

/i(i7,f) = 0, h (ty, r) ^ 0 

for all admissible variations i?(x), f (x) vanishing at xi and X2. For a maximum the 
last inequality is to be changed in sense. 

5. The fundamental lemma. In order to evaluate further the condition 
f) = 0, which has just been deduced, the following auxiliary theorem 
will be of service: 

Fundamental Lemma. Let M(x) be a bounded function on the interval X1X2, 
^ingk^lued and(sontinuous except possibly at a finite number of points at which 
it iSy however y supposed to have unique right- and left-hand limits^Then a necessary 
and sufficient condition that the integral 

Hm {x) {x) dx 


shall have the value zero for every admissible variation nix) vanishing at ax and X2. 
is that M(x) he a constant. 

The sufficiency of the condition can be easily verified. To prove the necessity, 
consider the particular admissible variation 


(S-1) 
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for which the constant c has been so determined that == o. One readily 
sees that the integral * 

(S-2) Aj/ (*) - c] ij' (*) d* 

must also have the value zero for every admissible variation T;(ac) vanishing 
at Xi and » 2 , if the integral of the theorem has this property. But the last integral 
has the value 

r\M(x)-c]^dx 

for the particular variation (5 1), and this value can vanish only if Af(x) = c 
at every point at which M(x) is defined. 

6. Necessary conditions from the first variation. For the particular case 
when f (jc) = 0, the first variation Ii(% f) has the form 

\fu‘n + 

and since ri{x) vanishes at Xi and x% and 

J* fydx = rify 4* J* fydx f 

except at comers of E or rj(x)j it follows by an integration by parts that 
Ix (v,0) = V' M dx . 

But 0) must have the value zero for every admissible variation ri{x) 
vanishing at xi and and hence the lemma of the preceding section shows that 
the expression in the square bracket must be constant. A similar argument 
applied to /i(0, f) justifies the following theorem: 

I. The First Necessary Condition. An admissible arc E is said to satisfy 
the condition I if there exist two constants c and d such that the equations 

(61) fy’^TfydX + C, f.’^^r/.dx+d 

•'*1 •'*1 

are identities dong £. Every admissible arc E which gives the integrd I a minimum 
or maximum value must satisfy the condition /. 

It is understood, of course, that in equations (6 1) the arguments of the 
derivatives of /are- the functions Xy y{x), 2 ( 05 ), y\x)y z\x) belonging to £. 

The equations of the theorem have three important consequences, the first 
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of which is the pair of differential equations (first discovered by Euler® in 
1744) which are described in the following corollary: 

Corollary 6 1. J ^ler^s Equations. On every sisb-arc between corners of 
an admissible arc E which satisfies the condition I the functions fy, U» have deriva- 
tives and the equations 

are satisfied. 

This is evident because the second member of the first equation (6 1), for 
example, has the derivative fy except*^t comers. At a corner it has forward 
and backward derivatives which are the values of fy calculated for the coordi- 
i nates af, y, z at the comer, and for the derivatives y', z' belonging to the sub- 
arcs of E beginning and ending at the comer, respectively. 

The expressions in the second members of equations (6 1) are continuous 
functions of x, and these equations therefore show further that at a comer of 
E the two values of fy' calcula ted f or the sub-arcs of E meeting at the com er 
must be the same . This gives a very simple justification of the following corner 
condition on a minimizing arc which was discovered independently by Weier- 
strass and Erdmann.^ 

Corollary 6*2. The Weierstrass-Erdmann Corner Condition. At 
each value x defining a corner of an admissible arc E that satisfies the condition /, 
the right and left limits of the functions fy^ and f,' are equal. This property is 
sometimes expressed by the equations 

fy z, y' U - 0) , z' (oc - 0) ] 

^ = /y' [x, y , Z, y' (oP + 0) , z' (Af + 0) ] , 

/*' [x, y» z, y' - 0) , z' (x - 0) ] 

= fz [Xy y, 2, y' (x + O) , z' (:r + 0) ] 

in which y'(x + 0), for example, is a symbol for the right-hand limit of y'(x) at 
the value x. 

It happens for many problems that this condition forbids the possession 
f^of corners by a minimizing arc. This is true for the brachistochrone and shortest- 
\ distance problems mentioned above and for all problems with integrands of the 
I form 

f^^(x,y, z) [l+y' 2 +z' 2 ]i /2 

with ^ 0, as one readily verifies. Cases are well known, however, for which 

mmimizing arcs do have comers. 

Hitherto, no assumptions concerning continuity properties of the arc E 
have been made except that it is continuous and consists of a finite number of 

* For a translation of Euler’s memoir see Ostwald, op, cit,, esp. p. 54. 

^ See, e.g., Bolza, VorUsungen abet Variationsrechnung, p. 366; also Weierstrass, Werke, 
VII (1927), no. 
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regular sub-arcs. Hilbert, however, has shown that under certain circumstances 
these sub-arcs must have continuous derivatives of orders higher than the first, 
as indicated in the following corollary: 

Corollary 6-3. Hilbert’s Differentiability Condition.* Let E be an 
admissible arc satisfying the condition I. Then near every element (x, y, z, y', z') 
of E which is not at a corner, and at which the determinant 


1 (6-3) 


/»'»' fy't 

ft'y fit 


is different from zero, the functions y(x), z(x) defining E have continuous nth 
derivatives when the integrand function f has all partial derivatives of orders ^ n 
continuous near (x, y, z, y', z'). 

The proof of this statement can be made quite simply by means of well- 
established properties of implicit functions. The equations 
fy' [*, yix) ,z (a:) , u, v] 



fy l^, y (a:) , 


2 (x) , y' (x) , z' (x) ] dx — c = 0 , 


f,' I*, yix) , z M , u, j»] 

- f y (*) . 2 (*) . y' (*) , z' (a:) 1 d* - tf = 0 

in the variables x, «, formed for the functions y{x)y z{x) defining £, are a pair 
of equations which may be represented by the notations 

F (oc, ii) = 0 , G {x, w, u) = 0 , 

and to which the theorems concerning implicit functions are applicable.® These 
theorems indicate that, near a particular solution x ^ «, v of these equations at which 
the functional determinant fuGo — FvGu is different from zero, the solutions of 
the equations form two functions u(x), v(x)j which have continuous derivatives 
with respect to x of as many orders as are possessed by the functions F and G 
themselves with respect to the variables x, w, v. But the equations above have 
the solution u{x) = y\x), v{x) = as is evident from the fact that E 
satisfies the condition I; and the functional determinant FuGv — FvGu of the 
equations is the determinant (6 -3) of Corollary 6 -3. Furthermore, the func- 
tions F and G have continuous first derivatives in x, u, v near each solution 
(jp, w, v) = [Xf y{x)y z'{x)] not corresponding to a corner of E, when the func- 
tion / has continuous second derivatives and y{x), z{x) have continuous first 
derivatives. Hence, near a point of E at which the determinant (6*3) is differ- 

* See, e.g.f Bolza, ofi. ctf., p. 30. 

• See, e.g., Bliss, ^Tundamcntal Existence Theorems/* Princeton Colloquium Lectures on 
Mathematics (New York, 1913), pp. 7-12, and “A New Proof of the Existence Theorem for Im- 
plicit Functions,** Bulletin of the American Mathematical Society^ XVIII (1912), 175-79. See 
also the Appendix of this book. 
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ent from zero the solution u(x) = y'(x)f v(x) = z'(x) has at least continuous 
first derivatives, and the functions y(x), z(x) have at least continuous second 
derivatives. One can prove more than this by making use of the third deriva- 
tives of /. For, if the functions y(x), z(x) are known to have continuous second 
derivatives, the first members of the equations above will also have continuous 
second derivatives, when / has continuous third derivatives, and hence the 
solution u{x) = y'(x), v(x) = z'(x) will have continucms second derivatives. 
By successive steps one can prove in this manner that,(when / has continuous 
nth derivatives, the functions y(x), z(x) defining a minimizing arc must also 
have continuous nth derivatives near qyery point, not a comer, at which the 
determinant (6 • 3) is different from zero J 

The proofs of the two preceding paragraphs do not apply at a comer of E. 
If the determinant (6 *3) is different from zero at the values x, y, z, y'(x -f- 0), 
z'(x + 0) belonging to a comer, it will be different from zero on a sufficiently 

short arc following the comer. On that arc the functions y'(x), z'(x) belonging 

to E will have continuous derivatives y", z" satisfying the equations 

/i/'x + fp'vy' + /y'* 2' + /yvy" + /y'*' 2" = /y » 

+ A'yy' + 2' + /sV y" + Av 2" = A , 

found by differentiating the equations (6 1) with respect to x. These equa- 
tions, solved for y" and z", show that y" and z" approach unique limits at the 
[corner, since y, z, y', z' have that property. But the well-known equation 

^g'ia + 0(x-a)] (0<e<l) 

X — a 

shows that a function g{x) having a derivative g\x) on an interval a < x ^ 
a + h will also have a forward derivative at x = a if g\x) has a unique limit 
at » = a. Hence y\x) and z'(x) have right-hand derivatives y" and z" at the 
comer. Similar arguments can be made for arcs with second end-points at a 
comer and for the higher derivatives considered in the last paragraph. 

There is a third equation similar to equations (6*1), and deducible by similar 
methods, which is at times useful and yet not often seen in the literature. If we 
write the equations of the minimizing arc in the parametric form 

(6-4) x^t, y^yit), z=z(/) {xi^t^x^), 

then this arc must minimize the integral 1 in the class of all parametric arcs 

with suitable continuity properties, joining the points 1 and 2 and having 
f '(0 > 0 on the interval hh- Every such arc is, in fact, representable in the form 
(1 * 1) originally specified. For sudi an arc the integral I has the value 
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where the primes on f, ij, f denote derivatives with respect to t. This is an 
integral similar to our original one but containing the variables t, ij, i instead 
of X, y, s. By reasonmg quite similar to that used above we infer that three 
equations similar to equations (6 ■ 1) must now be satisfied along the minimiwng 
arc.^pnly the first one gives new results, as one readily verifies, and it has the 
form 

Along the minimizing arc (6 *4) this takes the form indicated in the following 
theorem: 

Theorem 6 • 1. For every admissible arc E which gives the integral I a maximum 
or a minimum there must exist a constant b such that the equation 

(6- S) / - y'/y' - 2'/.' = r^dx + b 

is an identity along E. On every sub-arc of E between corners the equation 

( 6 - 6 ) =/. 

is satisfied, and at each corner of E the expression f — y'fy' — z'f*' has right aitd 
left limits which are equal. 

In equations (6-5) and (6-6) the primes indicate derivatives with re- 
spect to X as before. Equation (6 • 6) and the last statement of the theorem are 
easily deducible from equation (6 *5). Equation (6 -6) is also deducible directly 
from Euler’s equations of Corollary 61 when the arc E has second derivatives 
y", z" on its sub-arcs between corners. 

7. Families of extremals. The differentiability property of an admissible arc 
E satisfying the condition I, proved in Corollary 6 -3 of the last section, is im- 
portant because it shows that the Euler differential equations differentiated 
out in the form 

^ /y'»“l" fvvy fyz'^ fy~^ i 

^ + .A'y^' + + /,vy" + /,'/ z" - /* = 0 

are satisfied by every sub-arc of E along which the determinant (6 • 3) fy*yft>z’ — 
is different from zero. An arc on w hich th is determinant is different frgm 
zero will be called a non-singular subSrc of . The application of the corollary 
iiTfhe fact that the derivatives dfyfdx and df/Jdx along £, occurring inj 
Euler’s equations, can be expressed in the expanded form given for them inj 
the last equations when it is known that the functions y{x), z{x) defining E 
have second derivatives. 

An admissible arc defined by functions y{x), z{x) having continuous first and 
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second derivatives, and satisfying the equations (7 ■ 1 ), is called a n extr emal. 
From the results just stated it is evident that every non-singular sub-arc, 
without comers, of a minimizmg arc E, is necessarily an extremal. 

The differential equations (7 • 1) are of the second order; and the totalit y 
of their solutions wUli therefore, in general, fo rm a four-pa ramet er family of 
mirves d efined by fimctions of the form 

(7-2) yix, a, b, c, d) , z{x, a, h, c, d) . 

For a particular case the family (7-2) is to be found by any available method 
of integrating equations (7 - 1). It is frequently helpful to know that, when one 
of the variables », y, z is not present explicitly in the integrand function /, a 
first integral of Euler’s equations can be written down at once. If x is not pres- 
ent, equation (6-5) or the readily provable relation 

- .'/.■) - / (/.-s ^■■)+ 

shows that a first integral is 

I z = constant . 

If is not present, a first integral is fy* = constant, and a similar remark holds 
for z. 

Near a non-singular extremal arc £, equations (7 1), linear in y" and z", 
have solutions for y", z" of the form 

(7-3) y" = .4(a;, y, z, y', z') , z"=i5(a;, y, z, y', z') ; 

. and the functions i4, B have continuous partial derivatives of at least the order 
I w — 2 when the integrand function / has continuous derivatives of the mXh 
order. The existence theorems for differential equations^® tell us that these 
equations have one and but one solution through each initial element {x, y, z, 
2 ') = (f, rt, f, 17 ', fO in a sufficiently small neighborhood of those on the 
arc £. The functions defining these extremals have the form 

(7-4) ip{x, T 7 , f, 77 ', f'), V, f, >?', f'), 

and the functions have continuous partial derivatives of at least as 

high orders as those possessed by A and £ in a neighborhood of the sets (x^ ij, 
belonging to JE. We can now prove the following theorem: 
^-^ HTheorem 7 1. The Imbedding Theorem. E very non-singular extremal arc 
E ts imbedded for values xi ^ x ^ X 2 , ao, bo, Co, do in a four-parameter family of 
extremals 

(7-5) y(«, a, 6 , c, d) , z («, a, i, c, d) 

See, e.g., Bliss, Princeton Colloquium Lectures on Mathematics (1913), p. 86; Goursat- 
Hedrick, A Course in Analysis, Vol. II, Part II, chap, ii; or, preferably, the Appendix, pp. 269- 
83 of the present volume. The theorems in the last reference are designed especially for use in 
the theory of the calculus of variations. 
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whose functions y, z, y*, z* have continuous partial derivatives of at least the second 
order^^ in a neighborhood of the sets (x, a, b, c, d) belonging to E. Furthermore^ the 
determinant 


ya 

yh 

Vc 

yd 

Z. 

Zh 

Z« 

Zd 

yax 

yhx 


ydx 

Zax 

Zffx 

z« 

Zdx 


is different from zero along E. 

The first part of the theorem is evident if we substitute for f in equations 
(7 *4) a fixed value xo on the interval X\X 2 and change the notations ri, f, ri\ f' to 

Cy d. 

It remains only to show that the determinant (7 6) is different from zero. 
The fact that the solution (7 -4) passes through the initial element (f, rj, ry', f') 
is expressed by the equations 

= f, V, f, ri'y f f, Vy f, »?', f ') , 

= £, riy f, ry', f ') , = Vy f, >?', f ') , 

which are identities in these variables. By differentiating them successively 
with respect to iy, f, ry', f' and remembering that 7y, f, ry', f' are now to be 

replaced by »o, a, b, c, rf, it is seen that the de termb^t X7;j) __has at jg « xp 

^e value unjty^ We shall see in Section 27 that, if for a family of the form (7 • 5) 
\ I this determinant is different from zero at one value of x, it remains different 
I'from zero everywhere on the arc E. 

7 *2. Through an arbitrarily selected point 0 of a non-singular ex- 
tremal arc E there passes a two-parameter family of extremals 

(7*7) y (x, a, /S) , 2 (x, a, j3) 

containing E for values xi ^ x ^ X 2 , ao, and such that the functions y, z, y*, Zx 

belonging to the family have continuous partial derivatives of at least the second 
order in a neighborhood of the values (x, a, j3) belonging to E. Furthermore^ the 
family can be so chosen that the matrix 

ya Za yax Zax 

(7-8) 

yo Z0 ypx Z0X 

has rank two at every point of E. 

The extremals of the family (7 • 5) which pass through the arbitrarily selected 
point (xoy yo, Zq) on E form such a family. The equations of the family can be 
found by substituting xq, yo, Zo for iy, f in equations (7 *4) and denoting ly', f ' 
by a, p. The rank of the matrix (7*8) is then evidently two, since the matrix 

We can replace “second order” by “order w — 2” if the integrand / is assumed to have 
continuous partial derivatives of order m with m> 2, 
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consists of the last two columns of the determinant (7 - 6 ), which is different 
from zero along £, The theorem can also be established by starting with 
any family (7 • 5) with the properties described in Theorem 7 • 1 and solving the 
equations 

yo«yUo, a, c, d) , 2 o= 2 (:ro, o, c, d) 

for two of the parameters a, J, d in terms of the others. 

If the functions (7 S) or (7 *7) are substituted in equations (7 *3), it is evi- 
dent that the second derivatives y*x, also have continuous partial derivatives 
of the second order. Furthermore, if the integrand function / has continuous 
wth derivatives in the region R of Section 3, then the functions B\n equa- 
tions (7 • 3) have continuous partial derivatives of order w — 2, and the same is 
true of the six functions y, 2 , yx, Zx, y«, z** belonging to the families (7 • 5) and 
(7-7). 

8 . Auxiliary theorems. In a number of situations to be encountered later it 
will be necessary to consider the variation of the value of the integral I taken 
along a variable arc E whose end-points 3 and 4 describe two fixed curves C 
and Dy as shown in Figure 8 1. The functions defining £ may be taken in the 



form y{Xy a), z{xy d), the displacement of E being caused by variation of the 
value of the parameter a. If / is a parameter defining the position of the point 
3 on C, then the coordinate xz of the point 3 and the value of a defining the arc 
E through 3 are functions of /, and the functions defining C may be written 
in the parametric form 

( 8 - 1 ) *3 (0 , y [Xi (t) , a (0 ] = ys (0 , » [*a (0 , o «) 1 = (0 . 

Since the point 4 on D is also determined when t is given, and by the same 
value of ^ as that corresponding to 3, it follows that D will be defined by func- 
tions of the same form as those for C but with the subscript 3 replaced by 4 
wherever it occurs. 

In order to carry through the analysis of the following paragraphs, it is 
assumed that the functions xz(i)f XiCO, defining the arcs C and D described 
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above, have continuous derivatives on an interval ^ Furthermore, 

for the values (x, a) specified by the conditions 

*3 (0 S ^ *4 (0 , « = a (0 t") 

the functions y{x, o), z{x, o) define admissible arcs without comers, and in a 
neighborhood of these values (x, a) they and their derivatives y»(a:, a), z,(af, a) 
have continuous first partial derivatives with respect to a. 

The value of the integral I taken along the arc £ is a function IQ) defined 
by the equation 

/ *4 

fix, y{x, a), z{x, a), y’ (x, a), z' (*, a)]dx, 

-I 

in which xz, Xa, a are the functions of / just described. The derivative of this 
function is 

( 8 - 2 ) /'(/) =‘[f^]* + ^£yf»ya + f.Za + fy'ya + f.’ Z:\dx, 


where the arguments of / and its derivatives are those shown in the integrand 
of /(/) and where the first term on the right indicates, as is customary, the 
value of the bracket at the point 4 minus the value at the point 3, If an arc E 
of the family happens to satisfy the Euler equations 


A. 

dx 


/«'-/v=0. 


SO that along it 


fyVa +fvya- (/v' y«) . /. + /»' *« - rfa; (/*' *») • 


then for that particular arc E the derivative (8 • 2) has the value 

-Is ■ 

From equations (8 1) it follows that along C 

(8-3) dyz = y'dxz + yada , d Zs = ^^dxz +Zada , 

and similar equations with subscripts 4 hold along D. Hence, when the values of 
yada/dt, Zadajdt are substituted from these equations, the following thearem is 
justified: 

Theorem 8*1. The value of the integral I, taken along a variable arc E with 
the continuity properties described above and whose end-points 3 and 4 describe 
two fixed curves C and D, has the differential 


(8*4) 


[fdx+ (dy — y'dx) fy*+ (dz— z'dx) f/] I 
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I at each position of the variable arc at which that arc satisfies Euler^s differential 
equations. In the expression for dl the values x, y, z, y', z' occurring in f and else- 
where are those belonging to Eat the points 3 and 4, and dx, dy, dz are differentials 
belonging toCorE, 

It is evident that the differential of I can be easily calculated, from the ex- 
pression (8 *2), for a position of the variable arc which is not an extremal. The 
case signalized in the theorem is, however, the important one. 

As a special case one should note that if the arc C or D degenerates into a 
point the formula of the theorem is still valid, the differentials dx^ dy^ dz 
along the degenerate arc being all zero. 

When the arc E satisfies Euler’s differential equations in every one of its 
positions, the formula of the theorem justifies a further interesting result. 
The functions y{x, a), z{xy a) in the second member of the formula and their 
derivatives y'(a?, a), z'ix^ a) with respect to x are all functions of / calculable 
with the help of equations (8 1). The differentials 'dxy dy^ dz are, furthermore, 
functions of t multiplied by dt^ defined by equations (8 *3). Hence the expressions 
.on the two sides of formula (8*4) are functions of t multiplied by dt and can 
be integrated with respect to t from a value /' defining the position £56 of the 
arc £ to a value defining £ 78 , as shown in Figure 8 * 1. If the notation /* is 
used for the integral 

(8-5) /*=/ {dy-ydx)Jy>->t{dz-z'dx)fA, 

the result of the integration can be expressed as in the following corollary: 
Corollary 8*1. If the ends of a variable extremal arc E describe two curves C 
and D, Ihe difference between the values of I at two positions Ese and E 78 of E, 
shown in Figure 8 • i, is given by the formula 

(8*6) / (£73) - / (£56) = /* (iDw) - (C57) , 

where I* is a notation for the integral {8 5), 

The integral /*, which occurs frequently in the theory of the calculus of varia- 
tions, was introduced by Hilbert and is usually designated by his name. In cal- 
culating its value along the arc Cs? in the formula of the corollary it should be 
remembered that the element (x, y, z, y', z') occurring in its integrand is that 
of the movmg extremal at its intersection with C 67 . The coordinates of this ele- 
ment are therefore functions of t. The differentials dx, dy^ dz, on the other hand, 
are those of Css itself. Similar remarks hold for the integral I*{Dts). 

9. The necessary conditions of Weierstrass and Legendre. The necessary 
conditions on a minimizing arc found in Section 6 were deduced by means of 
sp^ial variations Sy — ari{x), Sz ~ al{x), \^se derivatives with respect to x 
\ evidently approach zero with the variations’memselves when the parameter a 
' approaches zero. Weierstrass^ noticed that not all variations have this prop- 

^ Op, cit,f pp. 210 ff. 
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erty; and by c onstructing a family of arcs contaming the minimizing arc E, but 
wh ose directions do not everywhere approach those of £, he was able to prove 
the necessary condition which is to be deduced in this section. 

Let 3 be an arbitrarily selected point of the arc E, and let 4 be a second point 
on E at the right of 3 but so near to 3 that there is no comer of E between 



them. Through the point 3 an arbitrary admissible arc C may be chosen, 
defined by functions of the form 


F(x), Z(x) 


(xs^x^x+e) . 


It is then possible to join the point 5 of C to the fixed point 4 by a one-parameter 
family of arcs £ 54 , one of which is the segment £34 joining 3 with 4, The family 
of arcs 


(*) +[ 


F (o) — y (g) 
*4 — a 


J (*4 - *) , z (*) + [' 


Z ( g) — z (g) 
Xt — a 


] (» 4 -*) > 


in which a = ocs is the parameter, is an example of such a family. If the original 
arc £12 joining the points 1 and ^ is a minimizing arc, then the sum 

(91) /(Css +£ 54 ) = Pfix, F, Z, F', r)dx+I{E,,) 

•'*1 


I is a function of the coordinate ofs, and this function must not decrease as the 
point 5 moves away from 3 on the arc C. Its differential with respect to jcs 
must,^ t^refprejL nc^t beji egative w hen 5 is at J. But since the arc £34 satisfies 
Euler’s differential equations, it is possible to calculate the differential of 
/(£ 64 ) when 5 is at i by means of the auxiliary theorem of the last section. The 

differential of the whole sum above is then readily found to be the value at 3 of 

the expression 

fix, y, z, F', Z')dx-fix, y, z, y\ z') dx 

- idy-y'dx) fy' {x, y, z, y', z') 

- (dz- z'dx) (x, y, z, y', z') , 

where *, y, z, y', z' belong to E and where the differentials dx, dy, dz are those 
of C. The first term of this expression is the differential of the ^t integral in 
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the second member of equation (9 1), since F = y and Z = s at the point 3; 
and the other terms are the differential of I{En) at the point 3, as given by 
Theorem 8 1. With the help of the notation 

E (*, y, 2, y . y, Z') =• / (*, y, z, Y’, Z') - f (*, y, s, y', s') 

- ( F' - y') (at, y, s, y', s') - (Z' - s') {x, y, s, y', s') 

the differential of /(C35 + £54) when 5 is at J can now be expressed in the 
form 

E{x, y, s, y', s', F', Z')d*l» . 
and the following theorem is proved: 

II . The N ecessary Condi tion of Weierstras s^^w admissible arc E is 
said io satisfy the condition [I of W tier sir ass if at every element (x, y, z, y', z') 
of £ the condition 

E(x, y, s, y', s', F', Z') ^ 0 

is satisfied for every admissible element (x, y, z, Y', Z') (Y', Z') (y', z'). 

£wery E which minimizes the integral I must satisfy the condition II. 

The function (9-2) is called the £-function of Weierstrass. It evidently 
-^vanishes when the pair (F', Z') coincides with (y', z'). The exclusion from the 
theorem of the case when these two pairs are equal is made because in later 

( paragraphs it will be desirable to strengthen the condition £ ^ 0 by excluding 
the equality, and this can be done only for pairs (F', Z') which are different from 

The proof of the necessary condition of Weierstrass given above applies to 
all elements y, z, y', z') of a minimizing arc not belonging to a backward 
tangent at a comer. But if the condition holds at all elements preceding a comer, 
it must hold also, by continuity, at the element belonging to the backward 
tangent. 

There is a further important necessary condition on a minimizing arc, due to 
Legendre,^® which is an easy consequence of that of Weierstrass. It was original- 
ly proved in a very different manner and is much older than the condition II. 
By means of Taylor’s formula with a remainder term the £-function is expres- 
sible by the equation 

2£(x, y, z, y', 2', F', Z') - ( F' - y') V.V 

+ 2 ( F' - y') (Z' - «') fy'.' + (Z' - z') V.'*' , 
where the second derivatives of / have arguments of the form 
(9-4) *, y, 2, y' + (j(F'-y'), 2' + 9(Z'-2') (0<9<1). 

If F' and Z' are replaced by the values 

F'-y'+cij, Z'-s'+«f, 

^ See Ostwald, op. cit., No. 47, pp. 56 ff. 
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where jy and f are arbitrary real constants, and if e is then made to approach 
zero, one finds at once the following further condition: 

III. Legendre’s Necessary C onditio n. An admissible arc E is said to 
siUisfylKfcond^iidi^^ if at each element (x, y, z, y', z') of E the con- 

dition 

(9 5) ^VyV + 2 ^ 0 I- 

is satisfied for every pair of real values ri^ f such that = 1, the arguments 

of the derivatives of f being the coordinates (x, y, z, y', zfYqflhe element of E. Every 
arc E which minimizes the integral I must satisfy the condition III. 

The restriction = 1 is imposed because it will be desirable later to 

use a stronger condition derived from III by excluding the equality sign in 
(9-5). If the condition in the theorem holds for + f ^ = 1, it will hold for all 
real values iy, f , as one readily verifies. 

The conditions deduced from the first variation in preceding sections are 
valid for both minimizing and maximizing arcs, but this is not true for the con- 
ditions of Weierstrass and Legendre. The inequalities in the statements of these 
latter conditions, as given above, must be changed in sense for a maximizing 
arc, as one easily sees by examining the proofs. The situation is quite analogous 
to that in the calculus when maxima and minima of a function f(x) are being 
studied. The condition f(a) = 0 must hold at a value x a defining either a 
maximum or a minimum; but at a minimum the second derivative condition 
is /"(a) ^ 0, and for a maximum this inequality must be changed in sense. 

From the proof of Legendre’s condition III given above it is evident that 
III is a conseq uence of th e conditionjl jDf Weierstrass, but the converse is no^ 
Jwaystm eTThe consequence of III in the following corollary is not the same 
as II, but it will be useful in the proof of one of the sufficiency theorems which 
will be given in a later section. 

Corollary 9 1. If the condition 

(9-6) ryVyV + 2r7f /yV + > 0 

is valid at every element (x, y, z, y', z') of an arc E12 for all values ty, f such that 
T?* + f ® = 1, then the inequality 

Eix, y, z, y', z', F', Z') > 0 ^ 

will be satisfied at least for all elements (x, y, z, y', z') and (x, y, z, Y', Z') lying in 
a sufficiently small neighborhood N of those on Ew and having (Y', Z') ^ (y', z')« 

The proof is simple, since, if the inequality (9 *6) holds at every one of the 
close^set of elements (au, y, z, y', z', 17, f) for which 17* + f * = 1 and [x, y, z, y', z') 
belongs to £12, it will also hold when the arguments of the derivatives of/ are re- 
placed by the set (9*4), provided that (x, y, z, y', z') and (x, y, z, F', Z') re- 
main in a sufficiently small neighborhood of the set of elements of this type on 
£12. The formula (9*3) then establishes the desired result. 
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10. Envelope theorems and Jacobi’s necessary condition. Besides the three 
necessary conditions on a minimizing arc which have been proved in preceding 
sections, there is a fourth condition, the necessary condition of Jacobi, which re- 
mains to be discussed.^^ Historically the order of discovery of these necessary 
conditions was the Euler condition in 1744, Legendre’s condition in 1786, 
Jacobi’s condition in 1837, and the condition of Weierstrass about 1879. The 
proofs of the conditions given here are quite different from those of the original 
discoverers, except in the case of the necessary condition of Weierstrass, and it 
is for this reason that the order of presentation chosen is not that of discovery. 
The proof of Jacobi’s necessary condition to be given in this section depends 
upon a so-called “envelope theorem,” which seems to be the contribution of 
Darboux (1894) for the case of geodesic lines on a surface and of Zermelo (1894) 
and Kneser (1898) for more general problems of the calculus of variations.^® It 
is one of the most interesting and most beautiful theorems in the domain of 
geometrical analysis. 

A proof of the envelope theorem can readily be made with the help of Corol- 
lary 8-1. A variable extremal arc was there considered whose end-points de- 
scribed two curves, C and D. For two positions, Eu and £56, of the variable arc 
the values of the integral I were found to be related by the formula 

(101) I (Ese) - I (Eu) « /♦ (Z>46) - (C35) , 

where /* represents the Hilbert integral (8 5). In the special case when the 
curve C is a fixed point 1 and the variable extremal arc E is in every position 



tangent to the curve D, as shown in Figure 10 - 1, the last formula justifies the 
conclusion that 

/(£ie) -/(£u) =/(Z>46), 

since the term in formula (10* 1) must in this case be replaced by zero 

and since the term I*{D^ is seen to be equal to I{D^ because the direction 
dx:dy:dz of the curve D coincides with the direction I:y':z' of the variable ex- 

See Ostwald, op, ciL, No. 47, pp. 87 ff. 

Darboux, ThSorie des surfaces^ III (1894), 88; Zermelo, **Untersuchungen zur Variations- 
rechnung*’ (diss., Gattingen, 1894), p. 96; Kneser, MathemaHsche Annalen, L (1898), 27, and 
bis Lekrhuch der Variationsrechnung, 1st ed, (1900), p. 93, 2d ed. (1925), p. 116. 
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tremal arc E at their intersection. Thus the following remarkable theorem is 
justified: 

Tj iEOREM: 10 - 1. T he Envelope Theorem. If a one-parameter family of ex- 
tremal arcs E has an envelope D, as shown in Figure 10 - ly then the equation 

(10*2) I (El,) =/(£i4) +/(Z>46) 

holds for every position of the point 4 preceding the point 6 on D. 

The envelope theorem is analogous to the well-known string property of the 
evolute of a curve. An evolute D of a curve C, shown in the first diagram in 
Figure 10 *2, has the property that the end i of a string E will describe an arc C 



orthogonal to E when its other end 6 is fixed on D and the string is caused to 
wrap itself around D, In other words, the length of the composite arc Ez\ + 1^46 
is equal to that of Ese for every position of the point 4 on D at the left of 6, There 
is an envelope theorem slightly different from that in Theorem 10*1, which has 
a still closer analogy with this so-called ‘‘string property.^^ A direction dx\dy\ dz 
is said to be transversal to the direction i : 2 ^ at a point (x, y, z) if it makes the 

integrand 

, f^^. 2, y\ z^)dx+ (dy-ydx) fy> (Xy y, z, y\ z') 

(10-3) 

+ (dz - z^dx) f^> (Xy y, z, y', z') 

of.Hilbert^s in tegral vanish. W ith this definition, and by a slight modification 
of The prooToTthe last the^m, the following result can now be established: 

Theorem 10 *2. If each arc E of a one-parameter family of extremals is cut 
transversally by a curve C and if the family has further an envelope D, as shown in 
the second diagram of Figure 10 then the equation 

I(E„) =/(£34) +/(Z>46) 

holds for every position of the point 4 preceding the point 6 on D. 

The modification in the proof of Theorem 10* 1 in order to establish Theorem 
10-2 is simply that the term I*{C^^ in equation (10- 1) vanishes for the latter 
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theorem because C cuts the arcs E transversally, and not because C reduces to 
a point. For the length integral in space for which / = (1 + y'* + it is easily 

proved that transversality is orthogonality and that the extremals are straight 
lines. Hence the string property of the evolute is, in fact, a special case of Theo- 
rem 10*2. 

A contact p oint 6 of a n ^tremal arc £12 with an^envelope D , such as is^hown 
in ji^re tolbe^ j ^oinrcoi^ggfe to 1 o n the axc£^. In the next 

seSioiTa second d^nition ofaconjugate point is givenj'andlirSection 13 it is 
shown that the conjugate points 6 just described are included in this latter defi- 
nition. In terms of the notion of a conjugate point the necessary condition of 
Jacobi can now be stated as follows: 

IV. Ti y:oBi^s Necessary Condition. A non-singular extremal arc E 12 is said 
to satisfy the condition IV of Jacobi if it has on it between 1 and 2 no point 6 con- 
jugate to 1. Every non-singular minimizing arc E 12 without corners is an extremal 
arc satisfying this condition. 

The proof of the theorem is simple. In the first place it should be noted that 
the minimizing or maximizing arc £12 is non-singular and therefore an extremal 
arc, as stated in the second paragraph of Section 7, and it is the only extremal 
arc through one of its elements {Xy y, z, y', z') = ({, 17 , f, ry', f')- K there is a con- 
jugate point 6 on £12 of the kind shown in Figure 10- 1, then formula (10*2) 
shows that the composite arcs £14 + I >48 + -Eez all give to the integral I the 
same value as £ 12 , whatever the position of the point 4 preceding 6 on the enve- 
lope D. If /(£i 2 ) were a minimum or maximum, these composite arcs near £12 
would therefore also be minimizing or maximizing arcs, non-singular near the 
point 6y since £12 is so. In that case would have to be an extremal arc, which 
is impossible, since the only extremal through the common element (a:, y, z, y', z') 
of D 46 and £i 6 at the point 6 is £12 itself. 

The proof of Jacobi’s condition in the preceding paragraph applies only to 
conjugate points 6 which are contact points of an envelope D with a branch 
projecting backward toward the point i. It would fail, for example, in the not 
uncommon case^* when the envelope has a cusp at 6 with both branches receding 
from i. In the next section a more inclusive definition of conjugate points will be 
given, for which Jacobi’s condition, as stated above, can also be proved. For the 
conjugate points considered here the following stronger conclusion can be de- 
duced: 

Theorem 10-3. On a nonrsingidar minimizing a rc Ew wi thout comers there 
cannot be a co njugate point 6 of the type shown in Figure 1 0 • 1, with a branch of the 
envelope lyprojectink backward from 6 toward the if pint 1, either between 1 and 2 
or at 2._ 

An examination of the proof given above justifies this statement at once. It 
is easy to see that the condition IV and Theorem 10-3 apply to maximizing, as 
well as to minimizing, arcs. 

» For a discussion of the possible cases in the plane see Bolza, op. cit.y pp. 358 
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11. A second proof of Jacobi’s condition. Let £12 be a non-singular minimiz- 
ing arc without comers for the integral /, so that, by definition, the determinant 
/y'v'/*'*' (/v'»0* is different from zero along it. According to the differenti- 

ability condition of Hilbert in Corollary 6 *3, such an arc must have continuous 
second derivatives and be an extremal. It is only to such non-singular extremals 
that the Jacobi condition, deduced in the last section and here, applies. 

The proof of Jacobi’s condition which will be made in this section^’ depends 
upon the fact that the second variation 

liin, ?) = / '2baix, 71, f, 1/', ^')dx 

•'*1 


of Section 4, formed for a minimizing arc £ 12 , must be greater than or equal to 
zero for all admissible variations ??, f vanishing at Xi and X 2 . A pair rj, f making 
h(Vy f) zero must therefore define an arc which minimizes this second variation 
^in the class of admissible arcs in iciyf-space which join the points (aji, 0, 0) and 
(x 2 , 0, 0). A minimum problem for h is thus suggested which is similar in type to 
that originally proposed for the integral /, but in xiyf-space instead of a;yz-space. 
It is called ^e accessory minimuin problem, a nd the word “accessory” will fre- 
quently be used to denote its extremals and other concepts connected with it. 

The extremals of this new problem are the solutions ry, f of the differential 
equations 


( 111 ) 


A 

dx 


w,' — w, = 0 , 




Since 2co is the quadratic form in ly, f , iy', f ' whose coefficients are displayed in 
Section 4, it is readily seen that these equations are linear and homogeneous in 
the variables ly, f , ly', f ', ly”, f Furthermore, the determinant of coefficients of 
the terms in ry", f" is /y'v/*'/ — (/yv)* 5 ^ 0. Hence the equations are solvable 
for ry”, f”; and it follows, as stated for Euler’s equations in Section 7, that they 
have one and but one solution through each initial element (xo, lyo, fo, »yo', foO- 
In particular, the unique solution through (xo, voj f 0 , ^ 0 ', f o') = (^co, 0, 0, 0, 0) 
is 7y = 0, f = 0, as one readily verifies by substituting these functions in the 
equations. 

The equations (11-1) are sometimes called “Jacobi’s differential equations,” 
since for analogous problems their solutions were first utilized by him;^® but in 
the following pages they will be designated as the^cessory system of differential 
equations in accord with the nomenclature described above, and also with one 
introduced by von Escherich^® in his studies of more complicated problems. 

First given in Bliss, “Jacobi’s Condition for Problems of the Calculus of Variations in 
Parametric Form,” Transactions of the American Mathematical Society^ XVII (1916), 195. 

See Ostwald’s Klassikery No. 47, pp. 88 ff.; or Jacobi’s original publication, “Zur Theorie 
der Variationsrechnung und der Differentialgleichungen,” Journal fur die angewandte Mathe- 
matiky XVII (1837), 68. For further references see Bolza, op. cit.y p. 60. 

1* <*Die zweite Variation der einfachen Integrate, ” Wiener BerichtCy CVIl (1898), 1236. 
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Definition of a Conjugate Point. A point 6 is said to be conjugate to 
the point 1 on an extremal arc E12 if there exists a solution (17, f) = (u, v) of the 
accessory equations with elements u, v vanishing at xi and xe but not both identically 
zero between xi and Xe. 

This is a somewhat artificial definition whose significance will appear later. 
In particular, it will be shown that the contact points 6 of envelopes D described 
in the last section are conjugate to i in the sense defined here. 

To prove Jacobi’s condition IV for the new type of conjugate points, let 6 
be such a conjugate to 1 between 1 and 2 on E12, having a pair of functions «, v 
as described in the definition above. An admissible arc in x»7f-space with a corner 
at x^ can then be defined by the equations 

f) = for Xi^x^Xf^, 

( 11 * 2 ) 

(^, ?) = (0, 0) for x^^x^X2. 

With the help of equations (11 1 ) and the well-known and readily verifiable 
homogeneity property of Euler for the homogeneous quadratic form 2 co, 

( 11 * 3 ) 2 w= + + + > 

it is clear that the variations 77, f defined above give to the second variation the 
value 

i) — f * [T 7 w, + f«r + 77 'wv + do: 

•'*1 

= + J ; 

and this is zero, since the functions (77, f) = (w, v) vanish at xi and ace. The 
arc (11 • 2) in a(:77f-space would therefore minimize the second variation 72(77, f), 
when Ei 2 is a minimizing arc for the original problem in afyz-space, and would 
have to satisfy the Weierstrass-Erdmann corner condition of Corollary 6 -2 for 
the second variation at the point tf. Since w,,' and wf' vanish at the right of x% 
and have the values 

+ , tor = [/zV«' + 1® 

at — 0, this implies that u* and would vanish with u and v at ai;^ and hence 
that u and v would vanish identically, which is contrary to the definition of 
a conjugate point. Hence, when the arc Ei% minimizes 7 , there can be no point 
6 conjugate to 7 between 1 and 2 on £12. 

The proof of Jacobi’s condition, which has just been given, has the advantage 
that it applies to all conjugate points irrespective of the character or existence 
of an enveloping curve D through them. It does not, however, give any informa- 
tion about the case when the point 2 itself is conjugate to 7 , even when there is 
an envelope D through 2 with a branch extending toward 7 , as described in 
Section 10 . The methods of this and the preceding section are evidently supple- 
mentary in that each gives some information which the other fails to justify. 
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12. The determination of conjugate points.*® Four solutions »,) (i = 1, 
. . . , 4) of the accessory differential equations (11*1) of an arc £12 have a de- 
terminant 


Ui 

«2 

Us 

Ui 

Vi 

V2 

Vs 

Vi 

/ 

/ 

t 

/ 

Ui 

U2 

Us 

Ui 

/ 

/ 

/ 

t 

Vi 

V2 

Vs 

Vi 


which is either identically zero or else everywhere different from zero. For, 
when this determinant vanishes at one value of x, say at JCo, the constant co- 
efficients in the linear expressions 


( 121 ) 


U = C1W1+ C2tt2 + C3W3+ C4W4 , 
V- C 1 V 1 + C2V% + + 


can be determined so that not all of them are zero, and, furthermore, so that 
w, V and their first derivatives all vanish at Xq. But the pair (w, v) is also a solu- 
tion of the accessory equations, since these equations are linear; and hence u and 
V must vanish identically in x, as explained in the preceding section. The de- 
terminant above is then evidently identically zero if it vanishes at a single point, 
and the solutions Vt) are linearly dependent. 

A set of four solutions («,*, v,) with determinant d(x) different from zero is 
called a ‘‘fundamental system of solutions of the accessory equations.*' Every 
other solution («, v) is expressible in terms of them in the form (12 • 1). This fol- 
lows since for a solution («, v) selected arbitrarily the equations (12 ■ 1) and their 
derivatives with respect to x can be solved at a fixed value Xo for the constants 
c*. The solution 

U — CiUi — C2U2 — Cztiz — , 


V — CiVi— C2V2’- CzVz-- CiVi 

of the accessory equations then vanishes with its derivatives at Xo and hence is 
identically zero. 

Theorem 12*1. 1//or four solutions (ui, v,) of the accessory equations of a non- 
singular extremal arc E 12 the determinant 



«2 ix) 

Ui (x) 

«4 (*) 

Vlix) 

Vi{x) 

tl3 (*) 

Viix) 

«i (* 1 ) 

Ui (ati) 

Ui (aci) 

«4 (*l) 

»i (* 1 ) 

Vi (*i) 

Vi (*i) 

Vi (xi) 


*® For the properties of solutions of linear differential equations deduced in Section 12 one 
may consult also the references in the footnote of p. 68, below. 
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is not identically zero, then the points 6 conjugate to \ on E 12 are determined by the 
zeros X6 of D(x, xi). 

To prove this, let 6htz, point conjugate to 1 on £ 12 . The solutions («,*, w,) in 
the determinant D{x^ are necessarily linearly independent and form a funda- 
mental system, since otherwise the determinant D would surely vanish identi- 
cally. Hence a particular solution (m, v) characterizing the conjugate point 6, 
as in the definition of a conjugate point in Section 11, would be expressible in 
terms of the solutions («»•, v,) in the form (12 * 1). But since the elements w, v of 
such a solution must vanish at x^^ it follows that D{x^^ xi) = 0. 

Conversely, suppose that rce is a zero of the determinant D{x, x^. Then the 
constants d in the expressions (12 • 1) can be determined so that u and v vanish 
at both x\ and x^. The functions «, v so determined cannot vanish identically, 
since the solutions w,-, Vi are linearly independent; and hence xe defines a point 6 
conjugate to 1 on the extremal arc £ 12 . 

The determinants D{x^ xi) not identically zero are those for which the solu- 
tions Vi of the accessory equations are linearly independent. For, if the u^, 
are linearly dependent, the determinant evidently vanishes identically; if they 
are linearly independent, then D(x, xi) is not identically zero, since it has the 
form D(Xy Xi) = (x—XiyA(x, Xi) with A(xi^ xi) = d(xi) ^ 0, as indicated in the 
proof of Theorem 12 • 2 below. 

The following theorem is an easy consequence of the preceding paragraphs 
and is useful in the proofs of the sufficiency theorems of Chapter II. 

Theorem 12 -2,1/0 non-singular extremal arc E12 hcLS on it no point conjugate 
to 1, then the points 0 sufficiently near to 1, on the extension of E 12 to the left of 1, 
have also no conjugates on E 12 . 

To prove the theorem one may first subtract the third from the first row in 
the determinant D{x^ x^ of Theorem 12 - 1 and apply Taylor's formula with in- 
tegral form of the remainder. The differences in the first row of Z?(x, xq) then 
have the values 

J rl / 

' Ui[xo-\-e{.x — Xf,)]de (t=l,..,,4). 
0 

After a similar process has been applied to the second and fourth rows, it is clear 
that D(», xo) is expressible in the form 

D(*, * 0 ) = (* - * 0 ) (*, * 0 ), 

where ^(«i, «i) The function A{x, xi) is different from zero at x == Xi, 

since d(xi) 0; and it is different from zero on the rest of the interval Xi^ x ^ 
xt since 1 has no point conjugate to it on £ 12 . It follows readily by continuity 
considerations that A{x, Xo) will be different from zero on ^ x ^ Xi if Xo < xi 
is sufficiently near to xi. Furthermore, JD(x, xo) will also have this property, and 
the theorem is proved. 

Besides the method of determining conjugate points in Theorem 12 - 1, there 
is a second one, which may be described as follows: 
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Theorem 12 -3. If the columns of a determinant 


A(x) = 


Ui 

Vi 


U2 

V2 


are two solutions of the accessory equations of a non-singular extremal arc E 12 , with 
all elements Ui, vi (i = 1,2) vanishing at xi, and if the determinant itself is not 
identically zerOy then the points 6 conjugate to 1 on E 12 are determined by the zeros 
X6 of A(x). 

The proof is much like that of the criterion of Theorem 12 1 and need not be 
given in detail. It is necessary to know, however, that every solution («, v) of 
the accessory equations, with elements «, v both vanishing at xi, is linearly ex- 
pressible in terms of the solutions («„ »,) of the determinant A{x) of Theorem 
12 *3 in the form 


( 12 * 2 ) U=CiUi + C2U2, V=CiVi+C2V2. 

A first remark, in order to prove this, is that the determinant 


(12-3) 




Ui 

U 2 

» 

/ 

Vi 

V2 


is surely different from zero at xi. Otherwise, constants ci and C 2 could be deter- 
mined so that the functions «, v in equations (12*2) would have derivatives 
zero at xi; and since these functions themselves vanish there, it follows that they 
would be identically zero. This is, however, impossible, since A(x) is, by hy- 
pothesis, not identically zero. 

Suppose, then, that («, v) is an arbitrarily selected solution of the accessory 
equations with elements vanishing at Xi. Since the determinant (12-3) does not 
vanish at Xi, constants Ci and C 2 can be determined so that the derivatives of the 
solutions 

u— CiUi’- C 2 U 2 , V — CiVi— C 2 V 2 

both vanish at Xi. But these solutions themselves are zero at xi and hence van- 
ish identically. The remainder of the proof of Theorem 12 • 3 is like that of Theo- 
rem 12 1. 

One of the very striking contributions which Jacobi made to the theory of the 
calculus of variations was his use of the fact that when a family 

(12-4) y(«, a, 6, c, d) , z («, a, b, c, d) 


of extremals is known, such as was described in Theorem 7 - 1, solutions of the 
accessory equations along every member of the family can be found by simple 
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differentiations. To simplify the proof, only the parameter a will be exhibited 
for the moment. The first of the Euler equations 

^ /v' [*. y (* «) 1 2 (*, «) » y' (*. «) . 2' (*. «) 1 

- /» [». y (*, a) , Z (*, a) , y' (*, a) , z' (*, a) ] = 0 
is an identity in x and a. When differentiated with respect to a, it gives 

[ fv^vVa fyt^a 4 “ ly y ya 4 “ Jy z* 2 ^] ^ 

[/vyyo4* JyzZa“\‘ /y»'ya4‘ /y*' == 0 

which is precisely the first of the accessory equations (11 1) with ry, f replaced 
by ya, Za. A similar argument shows that the pair (yo, Za) satisfies the second of 
the accessory equations as well as the first. Furthermore, three other solutions 
of the accessory equations are furnished by the derivatives of the family (12 -4) 
with respect to 6, c, d; and the following corollary is evident: 

Corollary \2’\. On a member E of a four-parameter family of extremals 

(12-5) y{x, fl, h, c, d), z{x, a, 6, c, d) 

the points 6 conjugate to a point 1 are determined by the zeros Xe of the determinant 

ya^^) yb{^) yd^) yd^x) 

Za^X) Zi,{x) Z^{x) Zdix) 

yaixi) ybixi) yc(^i) yaixi) 

Za(xi) Zb(xi) Zcixi) Zaixi) 

provided that this determinant is not identically zero along £. 

It is understood that in the elements of the determinant D the parameters 
a, by c, d have been suppressed for the sake of simplicity in notation. 

From the first paragraph of this section it is now evident that, if the deter- 
minant 

ya yb yc ya 

Z(f Zb Zff Zj 

d(x) = 

y<u, yhx yex yix 

Za» ^bx ^cx ^dx 

is different from zero at one point of an extremal arc £12 of a family (12 *5), it 
will be different from zero everywhere on as was stated in the proof of The- 
orem 7*1. The proof of Theorem 12 -2 shows, therefore, that on an arc £12 of 
such a family the determinant D of the corollary is not identically zero, since 
it has the form D « (r» — xi^A^x, Xi) with A(xi, xi) = d(xi) 3 A 0. 


D(x, xi, a, 6, c, d) = 
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As a final corollary, the following is readily provable: 

CoROLLAKY 12 • 2. Oh-o member E o/ a two-parameter family of extremals 

(12-6) yix, a, 0) , z{x, a, 0) 


each of which passes through a fixed point 1, the points 6 conjugate to 1 are deter- 
mined by the zeros xe of the determinant 


A(ap, a, j3) 


y« M yfi (x) 

Za ix) Zfi ix) 


provided that this determinant is not identically zero along £. 

The columns of a, fi) are solutions of the accessory equations, as has 
been seen above, and their elements all vanish at x\. This follows because the 
extremals (12 -6) all pass through the point 7, and the equations 


yi=y(xi, a, j3), Zi=2(a:i, a, jS) 


are therefore identities in a, /3. Differentiation of these equations with respect to 
a and j3 shows that the derivatives ya, 2#, y/j, all vanish at Xi, 

It happens for some special problems that the extremals can be found in a 
parametric form not easily reducible to the non-parametric one. Under such cir- 
cumstances it is important to have a criterion for the determination of conjugate 
points expressed in terms of the parametric equations defining the extremals. 
Let these equations be 

IT s= X (w, a, b, c, d) , 

(12-7) y= Y{u, a, h, c, d) , 

z — Z {u, a, 6, c, d) , 

The non-parametric equations would then be found by solving the first equation 
(12 • 7) for w as a function 

(12-8) u=U{x, a, 6, c, d) 

and substituting in the last two equations. The determinant D{Xf xi^ a, J, c, d) 
for the non-parametric family so determined is readily seen to be equal to the 
determinant 

Xaiu) XAu) Xciu) XAu) XAu) 0 

Ya{u) YAu) YAu) YAu) YAu) 0 

Za{u) ZAu) ZAn) ZAu) ZAn) 0 

X„(wi) XAux) XAui) XAni) 0 XAui) 

Fa(wi) YAui) YAui) YAui) 0 YAui) 

Zaiui) ZAui) ZAui) ZAui) 0 ZuiUi) 


(12*9) 
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except for a factor XuMXuMy which is different from zero since XyJUt ^ L 
The value U\ corresponds to x\ by means of the equation (12 *8); and if we de- 
note the determinant (12 -9) by D(uj ui, b, c, d), the conjugate points to 1 
on an extremal (12 * 7) will be defined by the zeros Uioi this determinant. 

13. The geometric interpretation of conjugate points. In this section it is 
to be proved that if a non-singular extremal arc En has on it a contact point 6 
with the envelope of a one-parameter family of extremals through the point i, 
then d is a conjugate to 1 in the sense of the analytic definition of a conjugate 
point in Section 11. Conversely, with the help of an additional hypothesis, it 
turns out that every conjugate point according to the analytic definition is a 
contact point of an envelope. 

Let the functions defining a one-parameter family of extremals through the 
point 1 with an envelope D be 

(13*1) y(x, /) , z(x, /) [xi^x^x{t) , 

The family is supposed to contain the non-singular extremal arc £12 for t ^ h; 
and each extremal of the family touches the envelope D, as shown in Figure 
10*1, at the point determined by x = ^(/), so that the functions defining D 
are 

(13-2) x{t), y[x{t), t] « F(Z), z[x{t), t] -Z(Z). 

The functions x{t) and (13 * 1) and the derivatives y„ z, are supposed to have 
continuous partial derivatives of at least the first order in a neighborhood of the 
values (x, t) belonging to the arc £ 12 , and it is furthermore assumed that yt, Zt 
do not vanish identically along £ 12 . The fact that D is tangent at each of its 
points to one of the extremals (13 * 1) is expressed by the equations 

(13*3) «'(/)=X, yx«'+yi = Xy,, z,x' + Zi«Xz,, 

where X is a factor of proportionality and where the arguments of the deriva- 
tives of y and z are x{t)^ U But these equations and the identities in Z, 

yi»y («i, Z), Zi* z(xi, Z) , 

imply that the elements of the solution y<, Zt of the accessory equations vanish 
at both X\ and x(Zo) = on the arc £ 12 . Since yi(x, Zo), Z((x, Zo) do not both van- 
ish identically on the interval the point 6 must be conjugate to i, according 
to the anal}rtic definition of a conjugate point; and we have the following the- 
orem: 

Theorem 13*1. LetEnbeaparticidlarextremalarcconlahiedinaone^parafneter 
family of extremals (13 * 1 ) . If the family has an envelope D, as shown in Figure 10 • i, 
and satisfies the hypotheses of the preceding paragraphs, then the contact point 6 of 
Eis with 13 is a point conjugate to Ion E 12 , according to the analytic definition of a 
conjugate point in Section 11. 
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In order to prove the converse of this result, let the extremal En be contained 
in the two-parameter family of extremals 

(13-4) yU, a, /3) , 2 (x, a, /3) 

of Theorem 7 • 2 for parameter values ao, /3o. All the extremals of this family pass 
through the point 1, Furthermore, let tf be a point conjugate to 1 on £12 defined 
by a zero of the determinant ao, /So). In order to carry through the anal- 
ysis, the additional assumption is made that the derivative A*(a;6, ao, /So) is dif- 
ferent from zero. From the value of the derivative A* as a sum of determinants 
it follows that ya, y^, Za, Z/s do not all vanish at {x^y ao, Po). If y^ is different from 
zero, for example, the first two of the differential equations 

Axdx -f-Aada -j-A/sd/S = 0 , 

(13-5) yada + yfid^ = 0 , 

Zada + Z/jd/S = 0 

can be solved for djc/d/S, da/dP; and they determine uniquely a solution x(0), 
a(/3) through the initial point (jce, ao, /So). The determinant A vanishes identically 
on this solution, since it vanishes at the initial point (xe, ao, /So) and has its total 
derivative with respect to /S identically zero because of the first equation (13*5). 
Hence the last equation (13 -5) is also satisfied identically by x(/S), a(/S). A sim- 
ilar argument can be made if ya vanishes, since one of the three derivatives y^, 
Za, Zfi must then be different from zero at (xc, ao, /So). 

In every case three functions, x(0, a(0, and /S(/), are determined, / being a or 
/S, which take the initial values xe, ao, /So for a value / = /o. On the one-parameter 
family of extremals 

V [x, a (0 , /3 (0 ] = y (Xy /) , 

(13-6) 

2 [Xy a (/) , Pit)] = z (x, t) 

the curve D defined as in (13 *2) by the function x(/) satisfies equations (13 -3), 
since the last two equations (13 • 5) show that the derivatives yi, Zt vanish identi- 
cally along it. It follows readily that the family (13 • 6) is a one-parameter family 
of extremals with an envelope D touching the extremal arc £12 at the conjugate 
point 6, 

Theorem 13 *2. If a point 6 on an extremal arc E 12 is conjugate to the initial 
point 1, according to the analytic definition of a conjugate point in Section 11 y and 
if the further hypotheses of the preceding two paragraphs are satisfiedy then 6 is a 
contact point of E 12 with the envelope Id of a one-parameter family of extremals 
through the point 1. If this envelope has a branch projecting from 6 toward the point 
1, then 6 is also a conjugate point to 1 of the kind discussed in Section 10. 

It should be noted that the envelope D of the theorem may have a singular 
point at the point 6 with no branch projecting backward from 6 toward the point 
1; or, in particular, it may degenerate into the point 6 itself. In these cases the 
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necessary condition of Jacobi cannot be proved, as in Section 10, by means of 
the envelope theorem. 

It will perhaps be interesting to consider further the geometric properties of 
conjugate points by means of somewhat more intuitive arguments. If the values 
of X which make the determinant a, p) vanish on the extremals (13 -4) are 
determined by a function x = X(a, p), then the functions 

X(a, P), ylX, a, p], z[X, a, p] 

determine a surface S, The normal to this surface at each point (a, P) is readily 
seen to be orthogonal to the corresponding extremal of the family (13 -4), and 



Fig. 13.1 


hence the surface is an enveloping surface for the family. One can also verify, 
without difficulty, the fact that 5 has no singular point near the point 6 if the 
matrix 

Aa ya Za 

Afi y$ Zfi 

is of rank two at 6, At each point of S the extremal of the family (13 • 4) tangent 
to S defines a direction, and these directions form a slope field on S similar to 
that defined by a differential equation of the first order in a plane. Hence, 
through each point of S there passes one and but one curve tangent to the slope 
field. The particular curve D of this sort, through the point of contact 6 of £12 
with 5, is an envelope of a one-parameter family of extremals through the 
point 1 of the type discussed frequently in preceding pages. 



CHAPTER II 

SUFFICIENT CONDITIONS FOR A MINIMUM 

14. Introduction. Before the time of Weierstrass, writers on the calculus of 
variations did not recognize the fact that when a number of properties of a 
minimizing arc have been discovered there still may be a doubt as to whether 
these properties are suflficient actually to insure the presence of a minimum. 
The necessary conditions of Euler, Legendre, and Jacobi were published in 
1744, 1786, and 1837, respectively, but apparently without inspiring any 
attempt to prove that an arc satisfying them would surely give the integral 1 
a minimum value. Weierstrass,^ however, after adding a fourth necessary con- 
dition in 1879, proceeded to prove by a very ingenious and beautiful method 
that the four necessary conditions known to him, when suitably strengthened, 
will actually guarantee the minimizing property. 

The sufficiency proofs given in the following two sections are essentially 
those of Weierstrass in their structure, though different from his in some of their 
details. A fundamental notion for his method of proof is the so-called “field 
of extremals.** The fields which he used are for problems in the plane and are 
special in character because all of their extremals pass through a fixed point. 
In Section 18, below, a definition of a more general field is given, applicable 
either in the plane or in higher spaces, and a sufficiency theorem is proved 
which in its interpretations for special problems is sometimes much more 
powerful than the theorems of Weierstrass. It will be noted that the notion of 
a field in a space of more than two dimensions is not merely a notational ex- 
tension of the corresponding notion for the plane. New characteristics are in- 
volved which were first studied by A. Mayer.* In Sections 20 and 22, below, 
and in Chapter III, the properties of Mayer fields are developed in more detail. 

The second variation of an integral of the calculus of variations has been the 
subject of extensive study, and an elaborate transformation theory has been 
devised in order to deduce from it the necessary conditions of Legendre and 
Jacobi. These conditions can now be proved by the simpler methods explained 
in Chapter I, but there remains a voluminous literature for the understanding 
of which a thorough knowledge of the second variation is required. In Section 23 
the essential results of the transformation theory of the second variation are 

1 WerkCy VII, 210-17 and 218-29. For a comprehensive study sec Duren, “The Develop- 
ment of Sufficient Conditions in the Calculus of Variations” (diss., University of Chicago), 
Contributions 1930^ p. 245. 

* “Cber den Hilbertschen Unabhangigkeitssatz in der Theorie des Maximums und Mini- 
mums der einfachen Integrale,” Sdchsischer BerichtCt LVII (1905), 49, and Mathematischc 
Annalen, LXII (1906), 325. See also Duren, op, cit, 
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obtained quite simply by an application of the results of Section 19. Finally, 
in Section 24, a sufficiency proof for a weak relative minimum is given which de- 
pends upon the theory of the second variation but which, when properly 
phrased, does not depend upon the notion of a field. The first proof of this sort 
was also given by Weierstrass.* 

15. Auxiliary theorems. In order to develop the sufficiency theorems of 
Weierstrass, it is necessary first to study some auxiliary properties of two- 
parameter families of extremals. Let 

(IS l) y = y(«, a, /3), a, fi) 

be such a family, containing a particular extremal arc £12 for values xi^ x ^ X 2 , 
ao, 00 and having the properties described in Theorem 7 • 2. Such a family is 
said to simply cover a region F of xyz-spsLce for values (x, a, 0) satisfying condi- 
tions of the form 

(15-2) X ^ X 2 +€ , |a — ao|^6, \0 — 0o\^€ 

if through each point (jc, y, z) of F there passes one and but one of the ex- 
tremals; or, in other words, if for each point (Xy y, z) in F the equations (IS * 1) 
have one and only one solution (Xy y, z, a, 0) satisfying (15*2). The values 
a, 0 belonging to these solutions define two single-valued functions a{Xy y, z), 
0(Xy yi z) in the region F. The functions 

pix, y, z)=y,[*, a{x, y, z), y, z) ] , 

(15-3) 

?(*, y, z) =z,[*, a{x, y, z), /3 (*, y, z) ] 

are called the slope functions of the family in F. If the determinant 

ACic, a, 0) -yaZfi — yoZa 

of the family (15*1) is different from zero in the neighborhood (15*2) of the 
values (Xy a, 0) on £ 12 , then well-known theorems concerning implicit functions* 
tell us that the functions a{xy y, z), 0{xy y, z) have continuous partial derivatives 
of at least the second order, since the second members of equations (IS 1) 
have this property. It follows at once that the slope functions Py q also have 
such derivatives. 

LEBfMA 15 • 1. // for a family of extremals {15 * 1) containing a particular arc E 12 
the determinant A(x, a, 0) is different from zero along E 12 , then there is a region 
{15 '2) of points (x, a, 0) and a neighborhood F of E 12 in xyz-space such that F is 
simply covered by the extremals {15 ■ 1) for values (x, a, 0) in {15 • 2) in the manner 
described above andy further y such that in F the slope functions p(x, y, z), q(x, y, z) 
of the family y as well as the functions a(x, y, z) and 0{xy y, z), have continuous 
partial derivatives of the second order. 

To justify the lemma, we shall first show that there exists a region (IS *2) 


* Werke, Vol, VII, chap, xviii. 


^ See the references in Sec. 7, above. 
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such that no point (x, z) belongs to two distinct solutions (x, y, z, a, 0) and 
(*, y, z, o', 0') of equations (IS - 1) with {x, a, 0) and (*,0', 0) satisfying (15 -2). 
With the help of Taylor’s formula with integral form of remainder term we see 
that 

. .V a. /3) -y(.x, o', 0') = (o-o') Ai+ (8-0) At , 

(lS-4) 

z(x, a, 0)-z(x, o', ^') = (a-a') Bi+(8-0')Bt 

where 

^1= Pyalx, a' + eu-a'), + 

•'0 

and A 2 , Bi, B 2 have similar expressions. Since the determinant 
D(x, a, 0, a', /3') = A 1 B 2 - A 2 B 1 

reduces to A(Xy a, 0) when (a', 0') = (a, 0), it is different from zero along £12 
and hence is different from zero in the region (15 *2) when e is sufficiently small. 
It follows at once from equations (15-4) that no two distinct solutions of equa- 
tions (15*1) satisfying (15*2) have the same projection (x, y, z). 

The second step in the proof of the lemma is to show that there exists a 
neighborhood F of the arc £12 in xyz-spsice such that every point (Xy y, z) in F 
belongs to a solution (x, y, z, a, 0) of equations (15 *1) satisfying (15*2). If this 
were not so, there would exist a sequence of neighborhoods Fn (» = 1,2 ,...) 
condensing on the arc £12, each containing a point (x, y, z)„ belonging to no 
solution. The points (ju, y, z)n would have an accumulation point (f, rjy f) neces- 
sarily on £12. But from the usual existence theorems concerning implicit func- 
tions, applied at the initial solution ({, 17, f, ao, 0o) of equations (15 *1), it would 
follow that to every point {x, y, z) in a sufficiently small neighborhood of 
(f> Vi f) there would correspond a solution satisfying (15 *2). This is a contra- 
diction. 

The solutions (x, y, z, a, 0) of equations (15 *1) satisfying (15 *2) and corre- 
sponding to points (Xy y, z) in F define two functions a{xy y, z), 0{Xj y, z) which 
have continuous partial derivatives of the second order in £. This can be seen 
by applying the implicit function theorem at each such solution (Xy y, z, a, 0), 
since the second members of equations (15* 1) have continuous second deriva- 
tives. It is easy to see, then, that the slope functions p{xy y, z), q{xy y, z) of the 
family (15 * 1) also have such derivatives in £, as stated in the lemma. 

Lemma 15 *2. If a non-singular extremal arc E12 has on it no point conjugate 
to its initial point 1, then on the extension of E12 there is a point 0 in the order 012 
such that the extremals through 0 form a two-parameter family {15 -I) with de- 
terminant A(x, a, 0) different from zero along E 12 . 

This is an immediate consequence of Theorems 12*2 and 7*2 and Corollary 

12 - 2 . 

16. The sufficiency theorems of Weierstrass. In the preceding pages the 
Roman numerals I, II, III, and IV designate the necessary conditions for a 
minimum deduced in Sections 6, 9. 10, and 12 of the preceding chapter. The 
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statements of the sufficiency theorems in this and later sections are greatly 
simplified by the introduction of further notations due to Bolza. The symbols 
II' and III' are used to denote the necessary conditions of Weierstrass and 
Legendre with the equality signs excluded in the statements of II and III 
in Section 9. Similarly, IV' is Jacobi's condition IV of Section 10 strengthened 
to exclude points 6 conjugate to 1 from the end-point 2 of an extremal arc En 
as well as from the interior of the arc. It is understood that C 12 is an arc with 
the points 1 and 2 as end-points and that /(C12) is the value of the integral I 
taken along this arc. 

If an arc £12 gives I a minimum value relative to the class of admissible 
arcs C12 in a sufficiently small neighborhood of the elements {x, y, z, y', 2 ') 
on £12, then 1 (£12) is said to be a weak relative minimum. A minimum provided 
by £12 relative to the class of admissible arcs C12, restricted only to have their 
points (x, y, 2 ) in a sufficiently small neighborhood F of £12 in xyz-space, is 



called a strong relative minimum. The first theorem to be proved is, then, the 
following one: 

Theorem 16* 1. Sufficient Conditions for a Weak Relative Minimum. 
If an admissible arc E 12 without corners satisfies the conditions 7, ///', /F', then 
there exists a neighborhood Ri of the values (x, y, z, y', z') belonging to E 12 smh 
that the inequality I(Ci2) > I(Ei2) holds for every admissible arc C12 in Ri and 
not identical with E12. 

To prove the theorem, one should first note that the condition III' implies 
that the determinant is different from zero along £ 12 . Other- 
wise the quadratic form (9-5) could not be positive definite at each element 
of £12, as III' requires. Since £12 satisfies condition I, it follows from the proof 
of Hilbert's differentiability condition in Corollary 6*3 that £12 has continuous 
second derivatives y", 2 " and satisfies the Euler equations differentiated out 
in the form (7 *1). Hence £12 is a non-singular extremal arc. 

Lemmas IS * 2 and 15 * 1 now tell us that there is a point 0 on the extension of 
£12 to the left of 1 such that the two-parameter family (15*1) of extremals 
through 0 simply covers a neighborhoend F of £12 in a^yz-space, as shown in 
Figure 16*1. For every admissible arc C 12 in F the one-parameter family of 
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extremals through the jMint 0 and a movable point J on Cn is a family to which 
Corollary 8 ■ 1 applies. The equation analogous to (8-6) for this family gives 

/ (£ 12 ) = / (^^ 2 ) - / (^ii) = / * (Ci 2 ) ; 


and, consequently, 


(161) 


/ (C 12 ) - I (£, 2 ) = / (C, 2 ) - I* (C, 2 ) 

= / ■£[*. y. z. ^(z:, y, 2), q(x, y, 2), y', 2'] dx , 


where the variables z in the integrand function E are to be replaced by the 
functions y{x), z(x) defining the arc C 12 and where p{x, y, z), q{x, y, z) are the 
slope functions of the family of extremals simply covering F. 

The conclusion of Theorem 16 • 1 now follows without difficulty when the 
neighborhood Ri is chosen so small that all of its elements (jc, y, z, y', z') and 
the associated elements 

[x, y, 2, P (x, y, z), q {x, y, z) ] 

are in the neighborhood N of Corollary 9 1. For then the integrand function E 
in equation (16*1) is positive unless 

(16-2) y' = pi.x, y, 2), s' = 9(x, y, 2), 

and the difference 1 (C 12 ) — /(£i 2 ) is positive unless these equations are satis- 
fied at every point of C 12 . The differential equations (16-2), however, have 
one and only one solution through the point /, and that solution is £ 12 . 

The following sufficiency theorem for a strong relative minimum involves a 
still further extension of the condition II of Weierstrass. An arc £12 is said to 
satisfy the condition Hat if there is a neighborhood N of the elements {x, y, z, 
y', z') on £12 such that the condition 

(16-3) £(x, y, z, y', z', F', Z') ^ 0 

holds for all sets {x, y, z, y', z', K', Z') with (x, y, z, y', z') admissible and in N and 
with {Xy y, z, F', Z') admissible and having (F', Z') ^ (y', z'). The condition II^ 
is this condition with the equality excluded in (16*3). 

Lemma 16*1. If an admissible arc E is non-singular and satisfies the cofidition 
//n of Weierstrass, then it also satisfies the strengthened condition N is 

properly restricted} 

To prove that the condition II^ holds, suppose that there were a set 
2 , y\ Y'j ^') with {x, y, z, y', z') in N and with {x, y, z, F', Z') 

(jc, y, z, y', z') admissible and such that the equality in (16 *3) holds. Then the 

* This lemma is due to Hestenes for non-parametric problems and to Hestenes and Reid 
for problems with differential equations as side conditions. See Hestenes and Reid, “A Note 
on the Weierstrass Condition in the Calculus of Variations,” Bulletin of the American Mathe- 
matical Society, XLV (1939), 471-73. 
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£-function with fixed x, y, z, Y\ Z' would have a minimum at {y\ z')> and at 
this point its derivatives 

h' - F') + ( z' - Z') /, V , (y' - Y') + ( 2' - Z') 

with respect to y', z' would have to vanish. This is evidently impossible if 'N is 
chosen so small that the determinant (6*3) remains different from zero in it. 

With the help of the lemma we can now prove the following theorem: 

Theorem 16 *2. Sufficient Conditions for a Strong Relative Minimum. 
If an admissible arc E 12 without corners is non-singular and satisfies the conditions 
/, //n, IV\ then there is a neighborhood F of E 12 in xyz-space such that the relation 
I(Ci 2 ) > I(Ei 2 ) holds for every admissible arc C 12 in F not identical with E 12 . 

The proof of this theorem is like that of the preceding theorem down to the 
last paragraph. The neighborhood F of £12 can be taken so small that all of the 
sets 

[*, y, z, pix, y, z), q(x, y, a)] 

belonging to F are in the neighborhood N for which the condition Il^r holds. 
The difference /(C 12 ) — /(£i 2 ) in formula (16 1) is then evidently positive or 
zero. Since by the last lemma the condition IIJl^ also holds when N is sufficiently 
small, this difference is surely positive unless the differential equations (16 *2) 
hold along C 12 , in which case C 12 is identical with £ 12 , as before. 

Evidently the conditions I, IlJ^r, HI', IV' also insure a strong relative mini- 
mum, since III' implies the non-singularity of £12 and since the remaining 
hypotheses of Theorem 16 *2 are immediate consequences of I, Iljlr, IV'. These 
are the conditions which have heretofore usually been given. 

In special cases the region R may have the property that when two elements 
{x, y, z, y[, z(), {xj y, z, y^, zj) belong to it so do all the elements {x, y, z, y', z') 
with yj ^ y' ^ y^, z[^ z' ^ Zj. In this case the region is said to be convex in 
the variables y' and z', and the following corollary to the last theorem often 
provides the simplest criterion for a minimum. The notation Illy designates 
the property that the inequality 

(16*4) fy'y^ + 2 A',' f + A'/ f 2 ^ 0 

holds for all admissible elements ( op , y, z, y', z') with projections {Xy y, z) in a 
neighborhood F of the arc £12 and for all pairs ri, f such that = 1. The 

notation IIIf is used for this property with the equality sign in (16 *4) excluded. 

Corollary 16 *1. If the region R has the convexity property just described, 
then an admissible arc E 12 without corners and satisfying the conditions /, ///p, 
IV\ will make I(Ei 2 ) a strong relative minimum, as described in the last theorem. 

It is evident, with the help of the relation (9*3), that the condition 11m 
is a consequence of Illj^, and hence the truth of the corollary follows at once 
from Theorem 16*2. The additional property of the region R is presupposed 
for the corollary in order that the relation (9 *3) may be applicable to all pairs 
of elements (x, y, z, y', z'), {x, y, z, F', Z') belonging to points {x, y, z) sufficiently 
near to £ 12 . 
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17. A comparison of necessary with sufficient conditions. With the help 
of the notations introduced in the last section it is possible to construct a table 
of necessary and of sufficient conditions which indicates clearly the gaps be- 
tween the two which remain to be closed. The table applies to admissible 
arcs without comers. 

Tabix at Necessary and of Sufficient Conditions 


T3rpe of Minimum Necessaiy Conditions Sufficient Conditions 

Weak relative I, III, IV I, III', IV' 

Strong relative I, II, III, IV I, IIjv, IV', E non-singular 

Strong relative I, IIj;^, III', IV' 

Strong relative I, Illf-, IV' 


If corners are permitted, the condition IV of Jacobi must be modified. Such 
a modification was originally deduced for parametric problems in the plane by 
Caratheodory and was later studied by Bolza and Dresden for parametric 
problems in the plane.® The minimizing properties of broken extremals in higher 
spaces have been discussed by Graves, Reid, and Smiley.*^ For many special 
problems, however, the Weierstrass-Erdmann corner conditions cannot be 
satisfied, so that minimizing arcs with corners are impossible. The table above 
is then applicable to all minimizing admissible arcs. 

According to its definition, an admissible arc has all of its elements (a;, y, 
y', z') interior to the region R, It is possible that there may be minimizing arcs 
some of whose elements are on the boundary of R. The theory of such mini- 
mizing arcs has been studied for some special types of problems, and the 
methods used can be applied to problems in three-space of the type under 
consideration here.® 

18. The definition and first properties of a field. In the proofs of the suffi- 
ciency theorems of Weierstrass in Section 16 the region F and its slope functions 
pix, y, z), q{x, y, z) played an important role. It can readily be seen that the 
Hilbert integral 

(181) /*=/ [fdx+(dy-y'dx)/y'+{dz-z'dx)f.-], 

• Caratheodory, “ttber die diskontinuirlichen LSsungen in der Variationsrechnung” (diss., 
Gdttingen, 1904), and Mathematische AnwUen, LXII (1906), 474; Bolza, Vorlesungen Uber 
Variationsrechnung, chap, viii; Dresden, “The Second Derivatives of the Extremal Integral,’ - 
Transactions of the American Mathematical Society, IX (1908), 480. 

^ Graves, “Discontinuous Solutions in Space Problems of the Calculus of Variations,” 
American Journal of Mathematics, LII (1930), 1-28; Reid, “Discontinuous Solutions in the 
Non-parametric Problem of Mayer in the Calculus of Variations,” American Journal of 
Mathematics, LVII (1935), 69-93; Smiley, “Discontinuous Solutions for the Problem of Bolza 
in Parametric Form” (diss., University of Chicago), Contributions 1933-1937, pp. 527-66. 

• See Bolza, op. cit., pp. 392 ff., where references are given to Weierstrass, Bolza, Bliss, 
and others. Sec also Bliss and Underhill, “The Minimum of a Definite Integral for Unilateral 
Variations in Space,” Transactions of the American Mathematical Society, XV (1914), 291-310. 
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with the arguments y' and z* replaced by the slope functions p and 9 , has the same 
value on all arcs Dn in F having suitable continuity properties and the same 
end-points 3 and 4. The family of extremals joining the point 0 to a movable 
point on such an arc is, in fact, a family to which Corollary 8 * 1 is applicable. 
For this family, equation (8 * 6 ) takes the form 

HE,,) -KEoz) =/*(Z>84); 

and since the first member is completely determined when the end-points 3 and 4 
are specified, the integral must have the same property. This result 

shows that the region F is a field in accordance with the following more general 
definition: 

Definition of a Field. A field is a region F of ruys-space with a pair of 
slope functions p{x, y, 2 ), q(x, y, z) having the following properties: 

a) They are single valued and have continuous first partial derivatives in F; 

b) The elements 

(18-2) [*, y, z, p{x, y, z) , q{x, y, z) ] 

defined by points {x, y, z) in F are all admissible; 

c) The Hilbert integral (18 *1) is independent of the path in F in the 
sense described above. 

The integral /* has the form 

(18-3) (Adx+Bdy+Cdz) , 


where A, B^C are functions of x, y, z with the values 
(18-4) ^ = 5=/,s C=// 


in which the arguments of / and its derivatives are those of the set (18*2). A 
simple computation shows that 


(18-5) 



Pdfy^ 

dy 


99fy' 

dz 


fy q GB» Cy) f 


Cy-Ay 



pdf,' 

dy 


qdf.' 

dz 


f, + p(JB,-Cy), 


where the partial derivatives indicated are to be taken with respect to the 
variables x, y, z occurring explicitly in / and its derivatives as well as in the 
slope functions p(x, y, z), q(x, y, z). 

It is well known that the equations 

(18-6) — C,«=0, Cx—Ag^O, Ay—Bg’^O 

are necessary conditions for the integral I* to be independent of the path. 
These conditions can be deduced readily here, however, by noticing that every 
arc in the field must be a minimizing arc for I* when I* is independent of the 
path. The Euler equations for the integrand f=A + By + C«' of I* must 
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therefore be identities in x, y, z, y\ s'; and the equations (18-6) are equivalent 
to these identities, as is readily provable. 

The solutions of the differential equations 

in the field F are all extremals, since along such a solution 

f _ x P^fv I f 

dx dx^ dy dz 

(18-8) 

dx dx^ dy dz 

where the partial derivatives are like those in equations (18 -5). The expres- 
sions (18 *8) evidently vanish as a consequence of equations (18 *5) and (18*6). 

The equations (18 *7) are called the differential equations of the fields and 
their solutions are the extremals of the field. Through every point of F there passes 
one and but one such extremal, so that the totality of extremals of the field F^ 
each determined by its intersection with a fixed surface in F, can be regarded as 
a two-parameter family. 

An important property of an extremal arc of a field is that the values 
/*(£i 2 ) and /(£i 2 ) are the same, since the integrand of /* in equation (18*1) 
reduces to / along £12 as a consequence of the equations (18 *7). 

19. A fundamental sufficiency theorem. The Hilbert integral associated with 
a field has the two following fundamental properties, described in the preceding 
paragraphs: (1) it is independent of the path in the field, and (2) it has the 
same value as the original integral I on an extremal arc £12 of the field. These 
properties enable one to prove the following theorem: 

Theorem 19*1. The Integral Formula of Weierstrass. If E 12 is an 
extremal arc of a field F with the slope functions p(x, y, z), q(x, y, z), then for 
every admissible arc C 12 in the field joining the end-points 1 afid 2 of E 12 the formula 

(19*1) /(C 12 ) -/(£i 2 ) = pE^x, y, z, p, q, y\ 'z')dx 

holds, where the variables y, z occurring in the integrand function are to be re- 
placed by the functions y(x), z(x) belonging to C 12 . 

The proof is simple, since, on account of the properties of the integral /*, 

/(£i2) = /♦(£i2) =/*(Ci2)- 

and consequently 

/ (C 12 ) - / (F 12 ) = I (C 12 ) - /* (C 12 ) . 

This becomes the formula (19*1) of the theorem when the symbols /(C 12 ), 
/*(Ci 2 ) are replaced by the integrals which they represent. 
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The formula (19 ■ 1) is sometimes called the “integral formula of Weierstrass.” 
It was proved in Section 16 for the special field whose extremals all pass through 
a fixed point 0, and it is now seen to hold also for the more general fields de- 
fined in Section 18. With its help the following important theorem can be 
proved: 

Theorem 19-2. The Fundamental Sufficiency Theorem. If Em is an 
extremal arc of afield F and if at each point of the field the condition 

(19-2) E{x, y, z, p{x, y, z) , q(x, y, z) , y', s'] ^ 0 

holds for every admissible set (x, y, z, y', z') with (y', z') ^ (p, q), then the rela- 
Hon l(Ci2) ^ I(Ei 2) is true for every admissible arc C12 in the field which joins 
the end-points 1 and 2 of E12. If the condition {19 2) holds without the equality 
sign, then I(Ci2) > I(Ei2) unless C12 is identical with E12. 

The conclusion of the theorem is evident with the help of the formula (19-1) 
of Weierstrass. If the condition (19-2) holds without the equality sign, then 
/(C12) > /(E12) except when the equations y' = «'.= 9 are satisfied at every 

point of C12. But in that case the arc C12 would have to coincide with £12, since 
the equations y = p, q have one and only one solution through the point 
i, and that solution is £12 itself. 

The sufficiency theorem which has just been proved is frequently a very 
useful one in the study of special problems. If an extremal arc £12 can be im- 
bedded in a field in which the £-function has the property presupposed in the 
theorem, then its minimizing properties are assured — at least within the field — 
and no further tests need be applied. In the following section several methods 
for the construction of such fields are described. 

20. Methods of constructing fields. It has been seen in Section 18 that every 
field F is simply covered by extremals which may be regarded as forming a two- 
parameter family. It is the purpose of this section to determine the conditions 
under which a region F, simply covered by a two-parameter family of ex- 
tremals, will form a field with the slope functions p{Xy y, z), q{Xf y, z) of the family. 

The family of extremals to be considered is supposed to be defined by 
functions of the form 

(20*1) y(«, a, jS), z{x, a, jS) , 

which with their derivatives y*, z» have continuous partial derivatives of at 
least the second order for all sets {Xy a, /3) satisf}dng conditions of the form 

(20-2) (a, jS) interior to il , «i(a, l9 )<»<« 2(a, /J). 

In these conditions is a region in the ajS-plane and Xi{ay j8), a^(a, fi) are sup- 
posed to be single valued, continuous, and distinct functions in that region. 
Such a family simply covers a region F of :x^s-space if, with the region (20*2) 
in place of (15*2), it satisfies the conditions described in Section 15 for the 
family (15*1) and the region (15*2). 
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If a function {(a, ff) is single valued, continuous, and has continuous par- 
tial derivatives of at least the first order in the region A, then the functions 

(20.3) ^ ^ P) (cl, P), a, fi] , 

f (a, jS) = z [{ (a, |8) , a, j3] 

define a surface 5. Such a surface intersects each of the extremals (20 1) in 
the point defined by rr = {. The following theorem now gives a first characteri- 
zation of the two-parameter families which simply cover fields: 

Theorem 20- 1. // a two-parameter family of extremals (20 -1) is cut by a sur^ 
face S of the form (20 -S) and if on S the integral 1*^ formed with the slope functions 
yx((y a, fi), Zx({, a, /3) of the intersecting extremals^ is independent of the pathy then 
every region F of xyz-space which is simply covered by the extremals is afield with 
the slope functions of the family, provided that the determinant A(x, a, ff) of the 
family is different from zero at each set of values x, a, corresponding to a point 
in F. 

The proof of this theorem is simple with the help of Corollary 8 1. Every 
arc D 46 in the region F determines a one-parameter family of the extremals 
(20* 1) intersecting the surface S in an arc Csb- The formula (8 *6) of Corollary 
8*1 applied to this one-parameter family gives the equation 

(20-4) I (£ 55 ) " / (£ 34 ) = /* (Z>46) - /* (C 36 ) , 

where the first Hilbert integral I*(D^ is formed with the slope functions of 
the extremals in the region F and the second /*(C 35 ) is formed with the slope 
functions of the extremals of the family at their intersections with the surface 5. 
Since the integral I*(CBb) so formed is, by hypothesis, independent of the path 
on 5, it follows that the values of the first, second, and last terms in the equa- 
tion (20 *4) are uniquely determined when the end-points 4 and 6 of the arc 
Dib are given; and /*(D 46 ) must therefore itself be independent of the form of the 
path D 46 in F. Under these circumstances the region F is a field with the slope 
functions of the family (20* 1), as required by the definition of a field in Sec- 
tion 18. 

The property of invariance for I* on 5, required in the theorem for a two- 
parameter family of extremals covering a field, is a natural one, since the Hil- 
bert integral is independent of the path in every field and therefore also on 
every surface S which cuts across the extremals of the field. It is not necessary 
for the truth of the theorem, however, that the surface S shall lie within the 
region F simply covered by the extremals. In special cases it may degenerate 
into a fixed point or a fixed curve. 

Theorem 20 1 provides a method of constructing two-parameter families of 
extremals which will form fields in every region which they simply cover. Let S 
be a surface defined by functions of the form 

(20-5) f(a, /3), ij(a, /3) , 


(a, jS) [ (a, /3)in A ] , 
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and let ij'(a, p), f'(a, 0) be two functions such that the elements 

(20-6) [{(a, jS), i?(a, 5), f (a, /3) , i,' (a, /3) , f ' (a, 0)] 

are all admissible. It is presupposed that all of the five functions within the 
brackets have continuous partial derivatives of the third order in the region A. 
The Hilbert integral (18* 1) with the slope functions j8), f'(a, 0), taken 
along arcs on 5, has the form 

f iPda+QdP), 

where 

P ~ (Va fy* “f* (fa ~ f /*' j 

<?=/{/*+ iV0-v'i0)fy^+ (f/J-f'W fz' , 

and the arguments of / and its derivatives are the functions (20 *6). If there is a 
function W(a, 0) with continuous partial derivatives of the third order in A, 
and such that P = Wat Q — then it is evident that the Hilbert integral 
just constructed is the integral of dW and is independent of the path on S. 
According to Theorem 20- 1, the two-parameter family of extremals with the 
initial elements (20-6) will then form a field in every region F which it simply 
covers. 

If the surface (20 S) and the function pr(o, 0) are arbitrarily selected in 
advance in such a way that the surface is non-singular and the functions f , tj, f , 
W have continuous partial derivatives of the third order, then a set of elements 
(20*6) related to them as described in the last paragraph, can be determined 
by solving for 77' and f ', as functions of a and 0^ the equations 

, ^ /fa+(i;a-17'fa)//+(f.-f'f.) ^ = PV. , 

(20-7) 

/&+ (17/^- >?'&) /y' + (r^- r' w , 

in which the arguments of / and its derivatives are t(o, 0), 77(0, 0), f (a, 0), and 
the variables 17', f'. Implicit function theorems assure us that solutions i7'(a, jS), 
f'(a, 0) exist, provided that equations (20 7) have an initial solution (ao, 0ot 
Vo 9 fo) with a direction 17^, fo tangent to S at the point defined by the 
values (tto, jSo), and such that the element 

[((ao) 0q) t 77 (ao, 0o) 9 f (aoi ^0) » ’7o> fo] 

is admissible and makes the determinant /y'y/*'#' — (/y'*')* different from zero. 
For the functional determinant of the first members of equations (20*7) with 
respect to the variables 77', f ' is the product 


17. 

r.-rf. 



ri$— 
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and this is different from zero when the direction determined by ijo, f i in xyz- 
space is not tangent to 5 at the point defined by ao, /3o, since the condition for 
non-tangency is the non-vanishing of the determinant 


1 









•na 





na-v ia 

ta-i'ia 



ta 



at the values (ao, /3o, i?o, f o) • 

An important special case of a family with the properties described in 
Theorem 20* 1 is a two-parameter family of extremals passing through a fixed 
point 0. In this case the point is a degenerate surface S on which the Hilbert 
integral is certainly independent of the path. In the proofs of the sufficiency 
theorems of Weierstrass in Section 16 use was made of a field simply covered 
by such a family. 

It is frequently convenient to choose the function W{a, p) equal to a con- 
stant for the equations (20-7), in which case the second members of these 
equations are zero. Every arc a = a(/), P = P{t) on the surface S then cuts 
transversally the extremal arcs with initial elements (20 -6) by which it is inter- 
sected, according to the definition of transversality in Section 10. In this case 
the surface 5 is said to be a transversal surface of the family of extremals. 

When the surface 5 is a plane x = Jico, its defining functions (20 -5) take the 
form xqj a, P, and equations (20 -7) reduce to the simpler pair 

(20-8) a, 0, i;', f') =1^., (o^o, a, jS, V, V) 

If one solution (ao, /3o, nL fo) these equations is known, at which the determi- 
nant — (/y'*0* IS different from zero, then the implicit function theo- 

rems are applicable and assure us of the existence of other neighboring solu- 
tions r)'(o, ft), f'(a, ft). 

21. Sufficient conditions for an integral to be independent of the path. The 
integral to be considered in this section has the form 

( 211 ) f (A dx+Bdy+C dz), 

where A, B,C are functions of x, y, z, which are single valued and have con- 
tinuous partial derivatives of at least the first order in a region F of xyz-space. 
An arc 

( 21 - 2 ) x(t), yit), zit) 

is said to be regular if the functions x(t), y(t), z{t) defining it are continuous 
and have continuous derivatives such that x'^ •+• y'* -f- z'* 0 everywhere on 

the arc. A transformation of the parameter / is defined by a function / =* ^(r) 
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having a continuous derivative ^'(r) > 0 on an interval rir 2 whose ends ti 
and 72 are such that /i ^(n), h » ^( 72 }. It is easy to see that the value of 
the integral (21*1) is unchanged when the parametric representation is trans- 
formed in this way. 

A regular arc Ci in F is deformable into a second such arc C 2 with the same 
end-points if the two arcs are members for a = fli and a = 02 , respectively, 
of a one-parameter family of arcs in F through the same two end-points and 
defined by functions of the form 

(21*3) x{t, a), y{ty a), z(/, a) [ai^a^a 2 j fi(a) ^/g /2 (a) ] . 

These functions and their derivatives Xt, yi, Zt and the functions / 2 (a) 
are supposed to be single valued and to have continuous first partial deriva- 
tives with respect to a for all values /, a satisfying the conditions in the parenthe- 
sis. The region F is said to be simply connected if every regular arc C\ in F is 
deformable into every other such arc with the same end-points by a sequence of 
such deformations. 

Theorem 211. If the identities 

(21*4) Bft — Cy = 0 , Cx"^Ag^0f i4y““5x^0 

hold in the region F for an integral (21-1) with the continuity properties described 
above and if F is simply connected according to the definition just given, then the 
integral is independent of the path in F. 

The first step in the proof of this theorem is to show that on all arcs of a 
family of the type (21*3) the values of the integral (21*1) are the same. The 
derivative with respect to a of the integrand of (21 1), when the functions 
(21*3) are substituted, is readily calculated, with the help of the identities 
(21*4), to be 

AXa’i‘Bya’\-CZa 

^ Ft ' 

Hence, the derivative with respect to a of the value of the integral itself is 

[ A ix% + *a) +B {yX + ya)+C{ zX + Za) ] [[ ] 

and this is zero, since the arcs of the family all pass through the same end-points 
for t ** ti(a) and t « fe(o), and x'ta + Xa and the similar expressions for y and z 
therefore vanish at these values. 

Since the integral (21 * 1) has the same value on all arcs of a family of the 
form (21*3), it follows readily, since F is simply connected, that it has the 
same value on all regular arcs in F which have their end-points in common. 
Consequently, if a fixed point 1 is chosen in F, the value of the integral (21*1) 
on regular arcs joining 7 to a second point (x, y, z) in F is a single-valued func- 
tion W(x, y, z). By integrating up to {x, y, z) on regular arcs starting at the 
point 1 and ending with straight end-s^^ents parallel to the axes near (x, y, z), 
it is readily provable that the three first partial derivatives of W are A, B, C. 



SUFFICffiNT CONDITIONS FOR A MINIMUM SI 

Hence the integral (21 • 1) is the integral of the differential dW and is independ- 
ent of the path, not only for regular arcs, but also for all arcs in F of the type 
(21 • 2) which are continuous and which furthermore consist of a finite number 
of regular arcs. 

It is interesting, though not essential for the sequel, to note that the only 
integrals of the form 

I=ff(x, y, z, y' z')dx 

which can be independent of the path are those of the type (211) with co- 
efficients B^C satisfying the conditions (21*4). For if I is independent of 
the path, every admissible arc must be a minimizing arc, and Euler’s equa- 
tions (7 • 1) must be satisfied identically in the variables y, z, y', z', y", z". 
The coefficients of y", z" in these equations are the derivatives /y'ys/y'tS 
Their vanishing implies that / is linear in y', z', and the relations (21*4) are 
then easily deducible as further consequences of the identical vanishing of 
the first members of Euler’s equations. 

22. Further properties of the slope functions and extremals of a field. The 
methods which have been described in Section 20 are usually the most effective 
ones for the construction of fields for special problems. It is interesting, how- 
ever, and for theoretical purposes sometimes important, to have the further 
criteria for fields which will be developed in this section. 

The slope functions p[x^ y, z), q{x^ y, z) are an essential part of a field, and 
one should seek to find the equations which characterize them independently 
of the extremals of the field. The desired equations are deducible from equations 
(18 *6) formed for the functions (18 *4). With the help of the identities (18 *5) it 
is evident that these three equations are equivalent to -B, — Cy = 0 and the 
two further equations found by setting the expressions (18 *8) equal to zero. 
After some manipulations these three equations take the form 

/y fyx ““ fvvp ” fy zQ "" Jyy "1" PyP ’^PzQ) 

- fy'z^(qz + qyp + qzq) =0 

(22* 1) /, fg*x fz'yP f***q /*'»' ^px PyP P^q) 

- iqz + qyp + q^q) = 0 

-B* Cy = fyg 4“ fy'y'Pa "H fyg* /»'y fzyPy "" “ 0 

in which the* arguments of the derivatives of / are ac, y, z, py q. 

Theorem 22*1. The slope functions p(x, y, z), q(x, y, z) of every field F satisfy 
the three equations {22' 1), Conversely ^ every simply connected region F of xyz- 
space is a field with the slope functions p(x, y, z), q(x, y, z), provided that in F 
these functions are single valued, have continuous first derivatives, define elements 
{x, y, z, p, q) which are admissible, and satisfy equations {22' 1). 
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The last part of the theorem is an immediate consequence of Theorem 21*1, 
since equations (22 1) imply the identities (21*4) for the coefficients Ay B^C 
of the Hilbert integral given in equations (18 *4). 

The value of the integral /, taken along the extremals of a family (20*1) 
between fixed limits x\ and x^y is a function /(a, jS) of the parameters of the 
family. If jS is kept fixed while a varies, the resulting one-parameter family of 
extremals is of the type to which Theorem 8 • 1 is applicable, and the formula 
(8 *4) of that theorem shows that the partial derivative of /(a, fi) with respect 
to a is 



A similar formula holds for the derivative with respect to If the two second 
derivatives Ia»y I$o. are now calculated and equated, it is found that 


( 22 * 2 ) 


ray a , . 3z a ,,_ay a az a 




dy 


dz 

dp 


When xi is kept fixed and X 2 is allowed to vary, it follows that on every extremal 
arc of a family (20*1) the expression between the brackets is constant. 

If the extremals of a family (20*1) simply cover a field F, the integral I* of 
the field must be independent of the path on the section of F by a fixed plane 
X = Xiy and also in the region of points (a, 0) which correspond to this section 
by means of the equations y = y{xiy a, /3), z = z{xij a, 0), In this a/3-region the 
integral 7* in (18 *1) is readily seen to have the form 




dy 

Jb' 


y + A' 0^) » 


where the arguments of the derivatives of / are the functions (20*1) and their 
derivatives with respect to x evaluated at x = xi. In order that this integral 
in the a/3-region shall be independent of the path, it is necessary that 



and this result shows that along the extremals of the field the expression be- 
tween brackets in (22*2) is not only constant but vanishes identically. The 
following theorem can now be readily established: 

Theorem 22*2. Along each extremal of every two-parameter family of ex- 
tremals of the form (20 *7) the expression 


(22*3) 


^ A / ,-lA A A f . 

da dP da dP dp da dP da 


remains constant. If the family simply cavers a field F, then the constant values of 
this expression on the extremals of the field are all zero. Conversely y if the values of 
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{22 *3) on the extremals of a family {20'1) are all zero, then every simply connected 
region F simply covered by the extremals and having the determinant A =yaZ^ — 
YfiZa of the family different from zero in it, is a field with the slope functions 
p(x, y, z), q(x, y, z) of the family. 

Only the last sentence of the theorem remains to be proved. The slope 
functions p, q of the family satisfy the first two of equations (22 - 1), since they 
are the slope functions of a family of extremals satisfying equations (18-7). 
To show that they also satisfy the third equation (22 • 1), one can first find the 
derivatives with respect to y and z of the solutions a{x, y, z), p{x, y, z) of equa- 
tions (20* 1) by solving the equations 

1 = yatty + yefiy , 0 = yatt* + yep , , 

0 ■“ ZaOty ^ePy y 1 ~ ZaOig ZfiP, • 

The result is 

(22*4) Atty = Zft , A^y = Za f Aa, = ye , A^x ~ ya > 

where A is, as before, the determinant yaZe y$Za- The coefficients B = /y', 

C = in the integral I* have the arguments x, y, z, p, q, which are, however, 
the same as the functions 

X, y{x, a, P), z{x, a, P) , y'(x, a, jS) , z' {x, a, P) 

with a, P replaced by the functions a{x, y, z), p{x, y, z). With the help of equa- 
tions (22 • 4) the expression B, — Cy is then found to have the value 


B -c / f.-^± f. 

‘ ■' dz ^ dz ay do-'* dy dp 


__ 1 / dy d e ^ X, ^ ^ /A 

which vanishes identically by hypothesis. Hence the result desired in the 
last theorem is a consequence of the theorem which precedes it. 

A two-parameter family of extremals (20*1) whose slope functions form a 
field in every region F which the family simply covers with determinant 
A{x, a, P) different from zero is sometimes called a Mayer family of extremals, 
after the man who first studied the theory of fields in higher spaces.* Such 
a family is a generalization of a normal congruence of straight lines, a two- 
parameter family of straight lines which is cut orthogonally by a one-parameter 
family of surfaces. It is well known that not every two-parameter family of 
straight lines in space has this property but that, if the lines of the family are 
all intersected by a single orthogonal surface, then there is a one-parameter 
family of such orthogonal surfaces and the family is a normal congruence. 
Similarly we know, from the results of Theorem 20*1, that if a two-parameter 
family of extremals is cut transversally by a single surface S, then the family 

* For references see Bolza, Vorlesungen, p. 648. 
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is a Mayer family; and we shall see in Section 28 that there is a one-parameter 
family of such transversal surfaces in every region F which the family simply 
covers. 

Theorems 20 * 1 and 22 - 2 give criteria for a two-parameter family of extremals 
to be a Mayer family. A particular case is a two-parameter family of ex- 
tremals all passing through a single fixed point. 

23. The theory of the second variation. Before the time of Weierstrass, 
writers on the calculus of variations inferred without proof that an arc £12 
surely gave the integral I a minimum value if the first variation f) along 
£12 vanished and the second variation 72 ( 17 , f) was positive for all admissible 
variations 17 , f vanishing at Xi and but not identically zero between these 
values. Their instinctive conclusion was correct for the case of a weak relative 
minimum, provided that £12 also satisfies the strengthened Legendre condition 
III'. For in that case the positiveness of the second variation implies the 
strengthened Jacobi condition IV' along £ 12 , as one sees with the help of the 
reasoning in Section 11; and the hypotheses of Theorem 16 * 1, the sufficiency 
theorem for a weak relative minimum, are therefore all satisfied along £ 12 . 
The proof of this theorem in Section 16 uses the notation of a field, but the first 
proof was made by Weierstrass^® with the aid of an expansion by Taylor's 
formula and without the use of a field. Proofs of this latter type, the so-called 
“expansion proofs," have also been given for a strong relative minimum 
but they are much more complicated than the corresponding proofs for a weak 
relative minimum. 

The impressions of the early writers mentioned above were the cause of the 
development of an elaborate theory of the second variation, the purpose of 
which was to specify the circumstances under which the second variation could 
be transformed into an expression which was surely positive. The literature of 
this theory is extensive and complicated, but many of the methods used can 
be greatly simplified by correlating them with the theory of the accessory mini- 
mum problem of the second variation described in Section 11 of the preceding 
chapter.^* It is to be shown in this section that by this correlation the result of 
the transformation theory of the second variation appears as an immediate 
consequence of the notion of a field and the integral formula (19* 1). The suffi- 

cit.j pp. 173-77. This proof seems to have been first presented by Weierstrass 
about the year 1877. See Duren, op. cit., pp. 245-349. 

n See Levi, **Sui criterii suffidente per il massimo e per il minimo nel calcolo delle variazi- 
oni,” Annali di matemaiica, XXI (1913), 173-218; Malnate, **Siii criterii suffidente per il 
massimo e minimo nel calcolo delle variazioni,” Giornale di maUmatiche di BaUaglini, LVII 
(1919), 79-102; and espedally Reid, **Suffident Conditions by Expansion Methods for the 
Problem of Bolza in the Calculus of Variations," Annals of Malhemalics, XXXVIII (1937), 
662-78. 

^ Bliss, "The Transformation of Clebsch in the Calculus of Variations, of the 

Inkmoiional Mathematical Congress, Toronto^ 1924, 1, 589. 
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ciency proof for a weak relative minimum, of Section 24 below, makes use of 
these results. 

Let £12 be a non-singufar extremal arc for the original integral L The ex- 
tremals of the problem of minimizing the second variation 

/ *% 

2 (a{Xy 17, f, 17S V)dx 


along £12, in the class of admissible variations 17, f joining the points {xij 0, 0) 
and 0, 0) in a[;T7f-space, are solutions of the accessory equations (11*1), 
which may be written with the notations 


(231) 


f) 


The homogeneous quadratic form 2(a satisfies the well-known identity 

where Wu, for example, is a symbol for w,(x, w, w, u\ v')- By means of this 
formula one readily establishes the identity 

17 / (w, w)+fiir(«, v)-uJ{ijy ^)-vK{riy t) 


(23*2) 


= ^ (t7Wu' + f Wv' - , 


with the help of which the following theorem is justified: 

Theorem 23*1. For every pair of solutions ( 17 , f) and (u, v) of the accessory 
equations {23 1) the expression 

(23*3) £( 17 , f; Uy ») = i 7 «u' + — 

is a constant. When it vanishesy the two solutions (17, f) and (u, v) are said to be 
conjugate. 

The constancy of the expression (22 *3) along an extremal is an instance of 
this theorem, since (ya, Za) and (yt, 25) are two solutions of the accessory equa- 
tions, and since the derivative 6/„y da, for example, is readily identifiable with 
the expression a),'(ap, ya, Sa, yi, si). 

If («i ®)> {'^h ^i)t and {u%y are three solutions of the accessory equations 
such that the last two are conjugate and have their determinant Uiv^ — u^Vi 
different from zero on the interval afiX2, the two-parameter family of extremals of 
the second variation 

i7«ii + aiii+6»2= 1 ? (», a, h) y 

(23-4) 

f « v + a»i + Ji>i«f {», a, b) 
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simply covers the region of xijf-space bounded by the planes x ^ xi and x 
and forms a field there. The expression 


dri d d dri d 

laJh^'^da db da 


db da 


xt 


analogous to the expression (22*3), is in fact L{ui, Vi; V2); and it vanishes 
because the solutions (ui, vi) and («2, ^2) are conjugate. The slope functions 
Vf f)> Vi f) of tho field in a»?f-space are found by solving equations 
(23*4) for a and b and substituting in the derivatives 

Vx(x, a, 6) + , 

txix, a, b) ~ v' + av[ + bv'^. 


If the determinant U 1 V 2 — u^vi is denoted by A, the slope functions have the 
values 


(23-5) 





u' 


u' 

u' 

jr(*. 

v^ 

u — 

V 

Ui 

U2 




» — 

r 

Vi 

»2 






K 


K (x, 

Vi 

M — 

n 

Ui 

U2 




ti — 

f 

Vi 

V2 


A new form for the second variation can now be easily deduced with the 
help of the formula (19*1). Since («, v) is an extremal of the family (23*4), it 
follows from (19*1) that for an arbitrary admissible arc in a??;f-space joining the 
end-points [xi^ «(aci), v(aPi)] and [0^2, u(x 2 )j w(aP2)] of this extremal we have 


?) -/*(«, v) 'Et,(x, 71, f, r, K, ij', t')dx, 


where the £-function, calculated for the quadratic form 2<a from the formula 
(9-2),has the value 

^ = (1?' — ir) + 2 (ij' — ir) (f ' — k) + (f ' — «) • 

This justifies the following theorem: 

Theorem 23 *2. If (u, v), (ui, vi), and (u 2 , V 2 ) are solutions of the accessory 
equations (23' 1) such that the last two are conjugate and have their determinant 
U1V2 — U2V1 different from zero on the interval X1X2, then for every pair of admissible 
variations 17, f with end-values at xi and X2 the same as those of (11, v) the second 
variation has the value 

(23-6) •'•t 


+ 2W- ») (f ' - k) U,' + (f' - k) V.'.'] dx 
in which v and k are the slope functions defined by equations (23 - 5). 
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In the literature on the second variation the differences — w, k are 
usually given in the forms. 




^ A 


--/ 

17 — u 


u* 

^2 

ri~“U 

Ui 

U2 

t-v 

Vi 

V2 

r- w' 

K 

< 

ri — u 

Ui 

U2 

f - V 

Vi 

V2 


which are readily deducible from equations (23 -5). 

If the non-singular extremal arc E12 for the original integral I has on it no point 
conjugate to i, there is always a pair of conjugate solutions (ui, Vi) and (m 2, ^2) 
with determinant different from zero on the interval XiX2^ For, according to 
Theorem 12 - 2 , there is then a value Xq less than xi which has no conjugate 
value on XiX2- The solutions (ui, Vi) and («2, V2) of the accessory equations, whose 
elements vanish at Xq and whose derivatives at xo form the identity matrix, 
are a conjugate pair, as one readily verifies. The determinant A = U1V2 — 
has a factor {x — iCo)® only, and hence does not vanish identically, as one sees 
by applying Taylor’s formula a,t x — xo to the elements of A separately. 
Theorem 12*3 then shows that A does not vanish on the interval X\X2 since 
the zeros of A are the values conjugate to xq and since there are, by hypothesis, 
none such on that interval. The truth of the following statement is thus 
established: 

Theorem 23-3. For a non-singular extremal arc E12 having on it no point 
conjugate to 1 there exists a pair of conjugate solutions (ui, vi) and (u2, V2) of 
the accessory equations with determinant different from zero on the interval X1X2. 
For every other solution (u, v) of the accessory equations the second variation 
^2(^9 f) along Ei2 is expressible in the form {23 6) for all admissible variations 
17, f having the end-values of u, v at xi and X2. If E12 satisfies Legendre^ s condition 
in the stronger form I 1 I\ then 12(^7, f) is always greater than l2(u, v) for such varia- 
tions unless 17 — usf — vsO(?» X1X2. 

An important special case is the one for which («, v) is the particular solution 
( 0 , 0 ) of the accessory equations. It follows in that case that under the hypotheses 
of the theorem the second variation is non-negative for every pair of admissible 
variations (17, f) vanishing at X\ and X2, and that for such variations the second 
variation vanishes only when (17, f) s (0, 0). 

Corollary 23 - 1 . If E12 satisfies condition IIV and has conjugate end-points 
but no point conjugate to 1 between 1 and 2, then for every accessory extremal 
(u, v) the value l2(u, v) is at least an improper minimum relative to the values of 
I2 on admissible arcs in n^-space joining its ends. 
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The corollary is an easy consequence of the theorem, as can be seen with the 
help of suitable approximation functions. An arbitrary admissible set of varia- 
tions (t/, 1*) is approximated both in position and direction on the interval 
— e, asc>0 approaches zero, by the set 

ff(x) =v(x) + (x- X,) “ ~ , 

X2 e ““ Xs 

X2 ““ e Xg 


The set of variations identical with rjj f on the interval xi ^ x ^ xg, with ff, Z 
on xg ^ X ^ xg — €, and with u, v on xg — € ^ x ^ xg, gives the second varia- 
tion a value greater than IgCu, v), by Theorem 23 -3, since there is no value 
conjugate to Xi on the interval rri < a; ^ iC 2 — e. By letting e approach zero 
we see that IgCv, f) ^ Mu, v). 

In deducing the formula (23 • 6) for the second variation, use has been made 
in this section of the notion of a field and the integral formula of Weierstrass. 
It is interesting, however, to know that the formula (23-6) can be obtained 
without these aids, though the argument is rather an artificial one. The notations 


(23-7) 


= aui + bug , 
^ — 14 ' = au[ + bu[ , 

U = a'«i + i'«2 , 
P = a'lij + , 


f— » — a vi +bvg , 
<c — v' = , 

V^a^vi + Vvg, 
Q^a'v[ + Vv[ 


will be helpful, where the notations of the first two rows are the same as those of 
preceding pages and where the last two rows define U, P, V, Q. It will be remem- 
bered that a, ft are functions of x with the derivatives a', ft'. We shall denote 
u, V, u', »') by «,(«, «'), with similar notations for other derivatives and 
sets of arguments. Then if 7i(x) and f(x) are given and a and ft are determined 
as functions of x by means of equations (23 *4), which are the same as the first 
two equations of the system (23 • 7), we have 

T-u') (aui+bug, au' + bu') 

(23-8) 

— IT — tt') +0)v'(U, P) , 

with similar equations for cuf, since (ui, Vi) and (ug, Vg) are solutions of the ac- 
cessory equations and the derivatives of ci^ involved are linear in a, ft. Further- 
more, since the solutions {u\, Vi) and (ih, Vg) of the accessory equations are con- 
jugate, the same is true of every pair of linear combinations of them, and it 
follows with the help of equations (23*7) and the expression (23*3) that 


(23*9) 2 (iy -«)«,' (17, P) ir-tt') »0 
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where the meaning of the sums is evident. This formula holds even when 
a, 5, o', V are arbitrary functions of a?, since its first member is a bilinear form 
in o, ft, o', ft' with coefficients all zero. 

With the help of Taylor’s formula, 

2o>(i?, 1?') -2w(tt, «') = 2S[ (17— «) a?„(w, «') 

+ (1;' — «') ] +2ci> (17 — — 

2a)(i? — w, — =2 (o(ij — «, IT — ti') 

+ 2 S (^' — it) cov — w, ir — w')+ 2 «( 0 , 1;' — t) . 

The first member of the first of these equations is, therefore, equal to the sum 
of 2ct)(0j — tt) and the expression 

22; [ (r; — «) «,(«, «') + (17' — w') 6),' («, w') ] 

+ 2S (17' — ir) wn' (17 — IT — w') +2a)(i7 — M, ^r — w'). 

But this last expression is the derivative of 

2 (17 “ w) [2a)V (m, «') +WV (ri — Uy T — w') ] , 

as one sees with the help of equations (11 1), (23 * 8 ), (11*3), (23 *9), and (23 * 7). 
Hence, for an arbitrary pair of admissible variations 17 , f the formula 

2 «(i 7 , 17') — 2w(«, w') ~2w(0, 17' — tt) 

(2310) j 

+ ^S(i7-«) I2wv(w, «') +wv(t7-w, 7r-«')] 

is valid, the last term 2co(0, 17 ' — ir) being the quadratic form £ 2 ^ just preced- 
ing Theorem 23 -2. An integration of this formula gives equation (23- 6). For 
the particular accessory extremal («, v) = (0, 0) equation (23 10) becomes 

(23-11) lain, v') t) +2«(0, v'-ir). 

This is a variation of a formula which has often proved useful in the theory of 
the calculus of variations.^* 

24. Sufficiency proofs without the use of fields. In the following paragraphs 
two proofs of the sufficiency of the conditions in Theorem 16 • 1 for a weak rela- 
tive minimum are first given, neither of which depends upon the notion of a 
field. The first proof is modeled after that of Weierstrass for parametric prob- 
lems in the plane, to which reference was made in Section 14. The second de- 
pends upon continuity properties of the solutions of differential equations con- 

“ Von Escherich, Wiener Berichte^ CVIII (1899), 1283, eq. (9) ; Bolza, op, ciL, p. 630, eq. (68); 
Hahn, Rendiconti dd circdo matematico di Palermo^ XXIX (1910), 64, eq. (48); Bliss, Bulletin 
of ike American Mathematical Society, XXVI (1920), 359, and Proceedings of the International 
Mathematical Congress, Toronto, 1924, 1, 589. 
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taining arbitrary functions established by Bliss^^ and is simpler than the first 
proof, provided that these properties are presupposed as known. Both proofs 
make use of the transformation of the second variation in Theorem 23*2 of 
the preceding section. As was seen in the concluding paragraphs of the last 
section, this transformation can be justified without the use of the notion of a 
field, and the sufficiency proofs here given can therefore also be regarded as 
independent of that notion. 

In the concluding paragraphs of this section a sketch is given of a similar 
proof, not depending upon the notion of a field, for the sufficiency of the con- 
ditions in Theorem 16*2 for a strong relative minimum. The details of this 
somewhat complicated proof may be found in the papers of Levi, Malnate, 
and Reid, to which reference was made in Section 23. 

Let En be an admissible arc 

y{x), z (x) (xi^x^ X2) 

without corners and satisfying the conditions I, III', IV', as presupposed in 
Theorem 16 * 1; and let 

(241) Y (x) ^ y (x) +ri (x) , Z (x) - z (x) + ^ (x) (xi^x^X2) 

be an admissible arc Cn joining the ends of £12 and lying in a neighborhood Ri 
of the values (x, y, z, y', z') belonging to £12. It is desired to show that when 
Ri is sufficiently small the difference 


(24-2) [/(*) y + Vt 2 + f» y' + n'r + 

-/(*, y, /, z*)\dx 

will be positive or zero and will vanish only when ij s f = 0 on 

The difference (24-2) can be expressed by means of Taylor’s formula with 
integral remainder in the form 


(24-3) 


A/ = fJ'Uyn + fy' v' + U + dx 

‘w(x, n, f. n', + jT ‘o(*, v, t, v', S')dx. 


where Q is a homogeneous quadratic form in 17, f , 17', f ' having the coefficient 

I/w(*, y + ev, z + H, y' + 0v'. z' + en 

y» y, z')]d9 

“Differential Equations Containing Arbitrary Functions,” Transactions of the American 
Mathematical Society, XXI (1920), 79. 
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for 17*, and other similar coefficients. The first integral in the expression (24*3) 
vanishes, as one readily sees with the help of the Euler equations of Corollary 
6 • 1, which are consequences of the property I of £12. Let 

be a quadratic form with distinct positive constant coefficients ki (i « 1, 
. . . , 4). Then A/ is expressible in the form 

(24*4) A/ = f (cj — <p)dx-i- f dx , 

•'*1 •'*1 

When the constants ki are fixed with sufficiently small values, the first integral 
in the formula (24*4) for A/ is positive unless ?y = f s 0 on Ji;i:r2. This can be 
seen with the help of the results of Section 23. The conditions III', IV' on £12 
and Theorem 23 • 3, in fact, imply that the accessory equations (23 • 1) for o) 
have a conjugate system of solutions w,*, Vi {i = 1, 2) with determinant 
U\V 2 — u^Vi different from zero on From well-known theorems concerning 
differential equations containing parameters the solutions of the accessory 
equations 

^ (wv — — («i| — kiTi) = 0 , 

for w — <^ with the initial values of «i, Vi, and «2, ^2, covi a.tx = xi 

will be continuous in the parameters ki, ^2, ^s, k^; and it can be seen therefore 
that they will also be a conjugate system, with determinant different from zero 
on XiX 2 if the constants ki are sufficiently small. Furthermore, the quadratic 
form with the coefficients 

fyy* ^3 fy*M* 

fy' 9* ^ 9* 9* ^4 

is surely positive definite for small values ki, since this same quadratic form 
with ifea = jfe4 = 0 is positive definite on account of the property III'. The trans- 
formation of Theorem 23 *2 therefore shows that the first integral in equation 
(24*4) is positive, as stated above. The quadratic form + 0 is positive defi- 
nite in the variables 1;, f, 17', f' when Ri is taken sufficiently small, since its 
coefficients will then differ very little from those of 0. This completes the proof 
of Theorem 16-1. 

For the second proof of Theorem 16*1 we may denote by 12(^, «, v, «', »') 
the homogeneous quadratic form in «, », 14', w' with the coefficient for w* equal to 

f\i-0)fyy(x, y + OVf z + et, y' + ^V, z' + sndS 
^0 
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and other similar coefficients. The difference A/ is then expressible by Taylor’s 
formula in the form 

A/ = f\(x, rt, f, V, ndx, 

since the first variation vanishes, as before. The conditions III^ IV' and 
Theorem 23-3 imply again the existence of a conjugate pair of solutions (m,*, 

(» 1, 2) of the accessory equations for 6) with a determinant uiv^ — « 2 »i 

different from zero on xix%. The pair of solutions of the accessory equations for 
M, », u\ »') with the initial values of («»*, ««;) (i = 1, 2) at ir = x\ is 

a conjugate pair and also has its determinant different from zero on xiXi when the 
neighborhood R\ is sufficiently small, since these solutions are continuous func- 
tionals in iy(ap), f(jc), r{{x), f'(af) and can be made to differ as little as is desired 
from (m<, (i = 1, 2) by restricting Ri, according to the theorems of Bliss re- 
ferred to above. Furthermore, the quadratic form 0, 0, w') is positive 

definite when R\ is small, since the same is true of the corresponding form <a{xj 
0, 0, »'). Theorem 23 • 2 then assures the inequality A/ > 0 for all admissible 

arcs (24* 1) in the nieghborhood Ri joining the ends of £12 and not vanishing 
identically. 

In order to make a proof by expansion methods of the sufficient conditions 
of Theorem 16*2 for a strong relative minimum, we may use the notations 
(24* 1) for the arc C12 and express the integrand of A/ in (24*2) in the form 

E{x, F, Z, p, ?, F', Z') 

(24*5) + + 

+ «(«, Vf f, v'f f') Y'-p, Z'-q) 

+B+{Y'-p)C+{Z'-q)D. 

In this expression the arguments of the derivatives of /, where not otherwise 
specified, are the functions x, y{x)y z{x)y y\x)y z\x) belonging to the arc £12 
whose minimizing properties are to be established, and 

26 » fvw (<r-py+ 2/v'.' {r-p) (z' - ?) + /// (z' -qy. 

The arguments py q will be specified presently. The symbols £, C, D have the 
values defined by the equations 

/(*, F, Z, p, q) -/(*, y, z, /, «') *■/.»» + /,f + 

+ /.'(?- z') +«(*, V, f, P-y', g-zO+B 

/»'(*. Yt Z, p, q) - + + /»V (^-yO 

+/.v(g-z0 +C, 

(*, Y, Z ^ , g) - + /.',u + /.'.f + /.'/ ip - y) 

+/,v(g-z') +D. 
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It is evident that they are also expressible as integral forms of remainders in 
three applications of Taylor's formula. 

The integral of the terms in the second row of the expression (24 • 5) vanishes 
by the usual integration by parts. Since the extremal arc £12 satisfies the condi- 
tions III' and IV', it follows, as for Theorem 23 • 3, that there exists a conjugate 
pair of solutions (wi, »i) and (« 2 , %) of the accessory equations along £12 with de- 
terminant different from zero on the interval X 1 X 2 . The integral of the third row 
in the expression (24 S) vanishes as a result of the transformation (23 *6) ap- 
plied to the integral of w, provided that we determine a, b, ir, k, q by the 


equations 



(24-6) 

H =» aui + but , 

f = a»i + 6»2, 

(24-7) 

ir=au' + bu' 

K = 0®'j+ , 

(24-8) 

P = y' + r , 

q= z' + K . 


The first two of these equations are equivalent to (23 *4) when « s i> s 0; and 
T and K are the corresponding slope functions (23 • 5) . Since Y— y — rj,Z — z — 
the variables p and q may be regarded as functions of x, F, Z, the slope functions 
of the two-parameter family of curves defined in a:FZ-space by the equations 

Y — y{x) + aui + 6^2 , Z = z (x) + a vi + 6 V2 . 

With the help of arguments like those used in the paper of Reid, to which 
reference was made in Section 23,^® it can be shown that, as a consequence of the 
properties Un and III', presumed for the arc £ 12 , there exists a neighborhood F 
of £12 in acFZ-space so small that along every admissible arc C 12 in F 

(24-9) E(x, F, Z, py q, F' Z') ^ r£ [mod (a', i') ] , 
where r is a positive constant and 

mod (a', b’) - (o'» + 6'*) , RU) = . 

In securing this result one must have regard for the relations 

7' — ^ — 2r = a'ui + b'u2 , Z' — g =* f ' — k = a' + 6' t>2 . 

The form B contains a, 6 in a polynomial which is homogeneous and of the third 
degree, as one sees from the equations (24 *6)-(24 -8); and the last two terms in 
the fourth line of the expression (24*5) contain a, 6, a', V in polynomials which 
are homogeneous and of the second degree in a, b and linear and homogeneous 
in fl', 6'. Consequently, the region F may be still further restricted, if necessary, 
so that along every admissible arc Cu in F 

|£| ^€[mod(fl, 6) ]* , 

I (F' — p)C+ (Z' — g) Dj g€mod(a, 6) mod (a', 6'). 

See also Tonelli, Fondatnenii di calcolo ddle variazioni^ I, 351, and the reference to Levi 
there given. 
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But Reid has also shown'* that if mod (a, i) ^ 1 
f mod (o, b) mod (a', i') dx^ki f 'r 

f’\mod (a, b) ] ^ * 2 /*’^ 


[mod (a', 6') ] do; , 
[mod (a', 6') ] dx , 


where and k 2 are positive constants. Hence for admissible arcs C 12 in a 
sufficiently restricted neighborhood F of the extremal £12 we have 

A/^ (r-€*i-€* 2 ) /***-R[mod(a', 6')ld:»^0, 


and the last equality sign holds only if a' s J' s 0, in which case a = b ^ 0 
and C 12 coincides with £ 12 . 

The account just given of Reid’s justification of the conditions for a strong 
relative minimum by expansion methods is incomplete in details. In particular, 
the proofs of the inequality (24*9) and of the last three integral inequalities 
above, require more attention. For problems with no side conditions, such as 
the three-dimensional one here discussed, Reid’s published proof of the in- 
equalities leading to (24 *9) can be greatly simplified, as he himself has indi- 
cated. 

M See section 5 of his paper last cited above. Note that a{xi) = 6(«i) « 0. Reid uses the 
function R(t) = (14- — 1, but has also suggested the one in the text here. 



CHAPTER III 

FIELDS AND THE HAMILTON-JACOBI THEORY 

25. Introduction. In Section 7 of Chapter I above, it has been shown that 
for the problem of the calculus of variations in xyz-sp&ce the solutions of Euler’s 
differential equations form a four-parameter family of curves. When a particu- 
lar problem is to be solved, the integration of these equations and the determi- 
n;ation of the extremals through the two given points 1 and 2 are of paramount 
importance and are frequently very difficult to carry out. For the development 
of the theory, however, it is sufficient, for the most part, to know the function- 
theoretic characteristics of the family of extremals neighboring a given one. 
These' are described in the so-called “imbedding theorems” of Section 7, which 
are to be re-proved by a different method in Section 27 below. 

The development of the theory of extremals has been greatly aided by the 
introduction of so-called “canonical variables” (x, y, z, «, v) in place of the origi- 
nal variables (jc, y, z, y', z') of the problem. We shall see that in terms of the 
canonical variables the differential equations of the extremals take a remarkably 
simple canonical form and that with the help of these new variables it is possible 
]to show that the extremals themselves are the characteristic curves of a partial- 
l differential equation of the first order. These theories, due to Hamilton and 
Jacobi, are explained in this chapter. They not only have been of assistance in the 
development of the theory of the calculus of variations but also have played a 
fundamental role in the applications of the theory to classical mechanics and 
to the recently developed quantum theories. A simple example of these applica- 
tions in dynamics is given in Section 30. 

Caratheodory has devised a quite new approach to a considerable portion of 
the theory of the calculus of variations. In his theory the extremals appear as 
so-called “curves of quickest descent.” A brief introduction to his point of view 
is given in Section 31. 

26. Canonical variables and canonical equations for extremals. When the 
variables x, y, z, y\ z' are replaced by a new set x, y, z, w, v for which u and v 
are defined by the equations 

(261) y, z, y', z') , » = /,'(«, y, z, y', , 

the differential equations of the extremals can be put into a remarkably simple 
form. The new variables x, y, z, u, v are called canonical variables^ and the equa- 
tions of the extremals in terms of them are called canonical eqmtions. 

In order to carry through the discussion, it will be assumed that the region R 
of Section 3 consists only of interior points {x, y, z, y', z') at which the determi- 
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nant — (/»'«')* is different from zero, and that the equations (26*1) de- 

fine a one-to-one correspondence between the points (a?, y, z, y\ z') of R and the 
points (Xf y, z, v) of the region S into which R is transformed by means of 
these equations. Equations (26*1) have then single-valued solutions 

(26-2) y'^P(x, y, z, u, v) , = y, z, u v) , 

which also relate corresponding points of R and S. 

A set (x, y, z, u, v) will be called admissible if it lies in the region 5, and an 
arc defined by functions 

y («) , z (*) , u{x) , V ix) ixi^x^ X 2 ) 

will be called an admissible arc if it is the image of an admissible arc in R, 
The arguments in this chapter are based upon the assumption that the inte- 
grand function / has continuous derivatives in the region R at least up to and 
including those of order m = 3, in accordance with remarks in footnotes 5 and 11 
on pages 7 and 17. According to well-known theorems concerning implicit func- 
tions,^ the functions P and Q in equations (26 -2) then have continuous partial 
derivatives of at least the second order, since, when the integrand function /has 
continuous third derivatives, the second members of equations (26*1) have 
continuous second derivatives. Furthermore, every point {xo, yo, «o, «o, vo) of S 
is an interior point, since for every point (x, y, z, u, v) sufficiently near to 
(xo, yo, Zo, uo, Vo) equations (26*1) have a solution y', z' defining a set (x, y, z, 
y', z') interior to JR. 

A function B(x, y, z, u, v) can be defined by means of the equations 

y> “ [//«' +*7/ 

(26-3) 

y, z, P, Q). 

The two forms for H are readily identifiable by means of the identities 

«-/,'(*, y, z, P, Q), »«/,'(», y, z, P, Q), 

with the help of which one can also deduce readily the following partial deriva- 
tives for H: 

Hu^P(x, y, z, u, v), !?,--/,(*, y, z, P, Q) , 

(26-4) y, z, u, v) , y, z, P, Q) , 

"-/.(*> y, z, P, Q). 

From these formulas it follows that B has continuous derivatives of order at 
least three when the function / has that property. 


^ See the Appendix below. 


31855 
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M^eorem 26- 1. Every extremal arc 
(26-5) • y{x), z{x) (xiS*^xO 

defines hy means of equations (26 • I) an admissible continuous solution 
(26-6) y(x) z(x), u(x), v(x) (xi^x^x^) 

of the equations 


(26-7) 


^ II 


dz 

dx 




du 

dx 




Conversely^ for every admissible continuous solution of these equations the functions 
y(x), z(x) belong to an extremal. 

Equations (26 *7) are called the canonical equations of the extremals.* 

The first part of the theorem is easily proved; for the functions u(x)j v(x)^ 
corresponding to the extremal arc (26-5) by means of equations (26- 1), neces- 
sarily satisfy equations (26 -2), so that 


dy 

dx 


= P 


Hu, 


dz 

dx 


= Q = H 


V • 


Furthermore, equations (26 - 1), (26 *4), and the Euler equations of Corollary 
6* 1 show that 


— / Tf 


dv 

dx 




Conversely, if an admissible arc (26*6) satisfies the first two equations (26-7), 
it must satisfy equations (26 - 1), and the last two equations (26-7) then imply 
Euler’s equations. 

The preceding results can be applied to the accessory minimum problem of 
the second variation. The equations defining canonical accessory variables 

Vf if <Pf ^f analogous to equations (26 - 1) for the variables x, y, z, u, v, are 

(26*8) ^ = cov (^j f , 17 ^, f ^ “ wf' (^> if V f f)> 

where Icj is the quadratic form in the integrand of the second variation, as de- 
scribed in Section 4 above. The equations (26-8) are linear in the variables 
Vf if v'f i' and have as their matrix of coefficients of rj', f' the matrix (6*3). 
Hence, for a non-singular extremal arc they have solutions 

(26-9) v'^n(x, V, i, 1 ^), 17 , if H>f 4')f 

* The introduction of canonical variables and equations into the problems of the calculus 
of variations associated with theories of mechanics is ascribed by Jacobi to Hamilton. See 
Jacobi, Vorlesungen iiber Dynamik, p. 143. These lectures of Jacobi were delivered at the 
University of Kdnigsberg during the year 1842-43. See also Hamilton, “Second Essay on a 
General Method in Dynamics,” Philosophical Transactions of the Royal Society of London, 
1235, p. 98. Lagrange had used differential equations of the canonical form in his theory of 
perturbations; see his Micanique analytique (1811), p. 336. 
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which are linear and homogeneous in i;, f , <p, The Hamiltonian function for 
the accessory minimum problem is defined by the equation 


(2610) 


ri, f, ipt = [i?W + fW 

-Tl<p+K\l^ — ct)ix, rif n, K). 


It is homogeneous and quadratic in the variables 17 , J*, ^ and has derivatives 

~ n , = K , 

- Wx , Qn = — Wn , = — Wf , 


analogous to (26 *4), in which the arguments of the derivatives of <a are x, 17 , f, 
n, K. The canonical accessory equations have the form 


(2611) 





# 

dx 


-Qr. 


Every accessory extremal rj(x), i{x) defines by means of the equations (26-8) a 
continuous solution of equations (26 - 11 ), and conversely. 

Equations (26 * 11) have second members linear and homogeneous in 77 , 
rp. There is one and but one solution of these equations taking given initial 
values 770 , fo, ^ 0 , ^0 at a value x xq. In particular, if these initial values are 
all zero, the unique corresponding solution is 77 = f s ^ s ^ s 0. If ijv(x)y 
^p(x)j <pp{x), 4^p(x) {v ^ 1, . . . , 4) are four linearly independent solutions of 
equations (26 * 11 ), then every other solution 77 ( 3 ;), f(ir), ^(a?), \l/{x) is expressible 
linearly with constant coefficients in terms of them. The determinant 



ritix) 

Vs(x) 

riiix) 

ri(*) 

h (*) 

f» (*) 

iiix) 

(Plix) 

(Pt(x) 

^»(*) 

Viix) 

^l(*) 

4 'Ax) 

i'iix) 

hix) 


of four solutions is either identically zero or everywhere different from zero, and 
a necessary and sufficient condition that the four solutions be linearly independ- 
ent is that it be different from zero. These are results which are well known 
from the theory of systems of linear differential equations.* 

27. A second proof of the imbedding theorem. By means of the canonical 
equations the imbedding Theorem 7 * 1 for a non-singular extremal arc £12 can 
be proved under the hypothesis that the integrand function / has continuous 
partial derivatives of the third order, instead of the fourth order as originally 
presupposed in Section 3. To justify this statement it can be shown, in the first 
place, that the elements (x^ y, z, y\ z') on a non-singular extremal arc £12 are 

> See Sec. 12, above; or, e.g., Goursat-Hedrick, A Course in Analysis, II, Part II, 152 ff.; 
Kamke, Differentialgleichungen reder Funkiionen, chap. v. 
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always contained in a region R which is transformed in a one-to-one way into 
a region S of points (rr, j, z, v) by means of the equations 

(27-1) u = fg>{x, y, z, y', z') , » = /.'(*, y, z, y’, z') 

in the manner described in Section 26. This is an immediate consequence of a 
theorem of Bolza^ concerning implicit functions. The solutions 

(27-2) y, z, v), z' = (>(:r, y, z, ») 


of equations (27*1), thus defined in R, will have continuous derivatives of the 
second order at least, since the second members of equations (27 * 1) have this 
property; and it is easy to see that the same will be true of the second members 
of the canonical equations 




dv 

dx 




The existence theorems for differential equations now justify the statement 
that through each initial element (x, y, z, «, v) = (iCo, fl, b, c, d) in a sufficiently 
small neighborhood of the elements belonging to £12 there passes one and but 
one solution of the canonical equations (27*3), Xo being a fixed value on the 
interval X 1 X 2 . The functions defining these solutions have the form 


y(x, a, bf c, d) , z(Xf a, b, c, d) , 

(27*4) 

u(x, a, b, c, d) , v(x, o, b, c, d) ; 
and the family contains £12 for 

Xi ^ X ^ X2 t a = do f b = bo f c = Co , d = do f 


where (xo, do, bo, cq, do) is the element (x, y, z, u, v) belonging to £12 at a; = xq. 
The functions (27 *4) and their derivatives y*, Zx, «x, have continuous partial 
derivatives of at least the second order in a neighborhood of the values (x, d, 
b, Cy d) belonging to £ 12 , since the second members of equations (27 *3) have this 
property when / has continuous third derivatives, as was presupposed at the 
beginning of this section. The first two functions (27 -4) define a four-parameter 
family of extremals, as was seen in Theorem 26 - 1. 

The determinant (7*6) for this family is different from zero dXx = Xo, since 
the product of the determinants 


(27*5) 


1 

0 

0 

0 


y« 

yt 

yc 

yd 

O' 

1 

0 

0 


Za 


Zg 

2d 

fv'v 

/»'. 

/»’»' 

V.' 


yaz 

yts 

ycx 

ydx 


w 

/«v 



Zax 

Zbx 

Hex 

Zdx 


^ Vorlesungen dber Variationsrechnungy p. 160, or Mathematische Annalen, LXIII (1906), 
246. See also Mason and Bliss, “Fields of Extremals in Space,” Transactions of the American 
Mathematical Society, XI (1910), 326, or Bliss, Princeton Colloquium Lectures, p. 20 and Sec. 8. 
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is the Jacobian d{y, z, u, v)/d(a, b, c, d), which has the value 1 at « » as 
one readily verifies from the identities expressing the fact that the curve (27-4) 
passes through the initial element (xo> b, c, d). It follows, as in Section 12, 
that the determinant (7-6) is everywhere different from zero on the curves of 
the family (27-4). 

It can be proved directly, by a method similar to that in Section 12, that the 
Jacobian d(y, z, u, v)/d(a, b, c, d) is everywhere different from zero, since by 
substituting the functions (27-4) in the equations (27-3) and differentiating 
with respect to the constants a, b, c, d we find that the columns of the Jacobian 
are solutions («;, i*, 4d of the linear equations 


(27-6) 


S — HfD/fl Byu<P Byjp , 


dx 


- Htufp - . 


By an argument quite similar to that for d(x) in Section 12 or by known theo- 
rems concerning linear differential equations,*^ it can now be shown that the 
determinant of four solutions of equations (27 *6) is different from zero every- 
where if it is different from zero at a single point. 

28. Transversal surfaces of a field and the Hamilton- Jacobi equation. A 

surface W{Xf y, z) = constant is said to be transversal to the direction 1 : y : z' 
at a point (x, y, z) if the equation 

(28*1) (/ - - «7.') dx + fy^dy + //d z = 0 

is satisfied by every set of differentials dx : dy : dz which satisfies the equation 
28-2) W^dx +Wydy +W,dz=- 0 . 

This property of transversality is equivalent to saying that every curve on the 
surface through the point {x, y, z) is transversal at that point to the direction 
I : y : according to the definition of transversality in Section 10. 

In a field F the Hilbert integral 

(28-3) /*-/ lif-pf,'-qf,')dx+f,'dy + f.'dz], 

formed with the slope functions of the field, is independent of the path in F. 
Taken from a fixed point {xo, yo, zo) in the field to a variable pomt {x, y, z), its 

* Gounat-Hedridc, op. cit., p. 154, eq. (125). See also the footnote on p. 68, above. 
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values define a single-valued function W(x y, z) whose derivatives are readily 
found, by the method i^sed in proving Theorem 21 • 1, to be 

(28*4) 

From the definition in the preceding paragraph it is evident that at every point 
(x, y, z) of the field the surface IV = constant through (x, y, z) cuts transversally 
the extremal of the field through that point. The surfaces W = constant are 
called the transversal surfaces of the field. 

Under the circumstances described in Section 26 the solutions p, q of the 
last two equations (28 -4) are the functions 

(28 5) p^Pix, y, z, Wy, W,), q^Qix, y, z, Wy, W.), 

where P and Q are the solutions (26*2) of equations (26 - 1). If these are substi- 
tuted in the first equation (28 *4), it follows from the definition of the function 
H in equation (26 *3) that the function W satisfies the equation 

(28-6) W^ + H{x, y, z, Wy, W,) =0. 

This is the so-called Hamilton- Jacobi partial differential equation,^ 

Theorem 28- 1. /» &oery field F the Hilbert invariant integral I* defines a func- 
tion W(x, y, z) having continuous second partial derivatives in F, satisfying the 
Hamilton- Jacobi partial differential equation {28'6)y and such that the surfaces 
of the one-parameter family W(x, y, z) = constant are transversal surfaces of 
the field. Conversely, let W(x, y, z) be a ftmction which has continuous second par- 
tial derivatives and which defines only admissible elements (x, y, z, u, v) = 
(x, y, z, Wy, W.) in a region F of xyz-space. Then, if W satisfies the Ilamilton- 
Jacobi equation {286), th^region F is a field with the slope functions {28-5), and 
the surfaces W == constant are the transversal surfaces of the field. 

It is evident that the function Wix, y, z) defined by the Hilbert integral has 
continuous derivatives of order two at least, as stated in the theorem, since in 
the equations (28 -4) the slope functions p and q of the field have by hypothesis 
continuous first derivatives at least. 

The last part of the theorem is easily provable, since every solution W of 
equation (28 • 6) with the properties described in the theorem defines slope func- 
tions p and q by means of equations (28 -5), which with W satisfy the equa- 
tions (28*4). But if these equations are true, the integral /* formed with the 
slope functions (28-5) is the integral of dW and hence is independent of the 
path. Thus, the region F is a field with the slope functions p and q, according 
to the definition of a field in Section 18. 

* This equation was apparently first introduced by Hamilton for problems of mechanics in 
1835 (fip, cit,, p. 100). Jacobi exhibited the relationship between its solutions and those of the 
canonical equations {op, cit,, pp. 157 ff.}. The corresponding partial differential equation for 
more general problems of the calculus of variations was suggested by Beltrami, ^*Sulla teoria 
delle linee geodetische,” Rendiconti del Reak Istituto Lombardo, Ser. 2, 1 (1868), 708-18, or in 
his Opere matematiche, 1, 368. 
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Theorems 20 1 and 22-2 give criteria for a two-parameter family of ex- 
tremals to be a Mayer family. With the help of the canonical extremals (27*4) 
such a family can be constructed as described in the following useful theorem: 

Theorem 28 *2. If W(a, b) is a function having continuous second derivatives 
in a region A of points (a, b) and defining only admissible elements (x, y, z, u, v) = 
(xo, a, b, Wa, Wb) for points (a, b) in A, then the two~parameter family of extremals 
defined by the functions 

y{x, a, 6, Way Wh) y Z{X, a, 6, Way Wh)y 

found by substituting c = Wa, d = Wb in the functions {27 4), is a Mayer family^ 
according to the definition of Section 22, 

This follows because a^b^c ^ Way d^Wh are the initial values £it x = Xo 
of the four functions (27 *4); and for an arc on the plane x = xo the integral 
I* therefore reduces to the form 

/*= / (fy'dy + J.'dz) = f (Wada+Wi^db), 

which is independent of the path. Theorem 20 *1 now justifies the last theorem. 

The transversal surfaces and extremals of a field have a variety of properties 
which are deducible from the two fundamental ones in the following theorem: 

Theorem 28*3. On an extremal arc Eia of a field the value of the invariant 
integral I* is equal to the value of the original integral I. On an arc C 12 lying in a 
transversal surface of a field the value of the integral I* is zero. On all arcs C 12 in a 
field F with end-points on the same two transversal surfaces the values of the integral 
I* are the same. 

The first of these properties follows from the fact that the differential equa- 
tions 

= y, z), y, z) 

are satisfied by every extremal of the field, and it is readily seen that on every 
solution of these equations the integrand of the integral (28 * 3) reduces simply 
to f{Xy y, z, pj q). To prove the second property we need only to note that the 
differentials dx : dy \ dz of an arc on a transversal surface W = constant 
satisfy equation (28*2) and hence also equation (28*1), as a consequence of 
equations (28*4). But the first member of (28*1) is the integrand of the in- 
variant integral (28 *3). The third property of the theorem is a consequence of 
the equations 

/♦(Ci 2 ) - f dW^W (X2y y 2 y Z2) ^ W (xi, yiy Zi), 

It further follows from this, with the help of the first statement of the theorem, 
that if there is an extremal arc £34 of the field joining the two surfaces, then 
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29 . Extremals as characteristics of a partial differential equation. The re- 
sults described in the preceding section justify the introduction of the partial 
differential equation (28-6) at this stage of the theory. It is evident from 
Theorem 28*1 that a solution W(Xy y, s, a, b) of equation (28 6), containing 
two arbitrary constants a and 6, determines a two-parameter family of fields 
with their corresponding two-parameter Mayer families of extremals; and the 
totality of these extremals should therefore form the complete four-parameter 
family of extremals of the problem. The proof of this result in the following 
paragraphs is independent of the developments in Section 28. 

The function W(x, y, z, a, b) in the following theorem is supposed to have 
continuous partial derivatives of the second order and to have its determinant 
WayWht — WatWhy different from zero for all points (x, y, z, a, b) satisfying 
conditions of the form 

(29*1) (x, y, z) in a region F , (a, &) in a region A . 

Furthermore, the sets (x, y, z, u, v) = (x, y, z, Wy^ Wt) corresponding to these 
arguments are all supposed to be admissible. 

Theorem 29 1.^ If a function W(x, y, z, a, b) has the properties described 
above and satisfies the partial differential equation 

(29-2) W, + n{x, y, z, Wy, W,) =0 

in a region {29 1)^ then every admissible arc E 12 interior to F, having no corners 
and satisfying the equations 

(29-3) Wa^c, W^-=^d 

for particular constant values ao, bo, Co, do, is an extremal. Furthermore^ equations 
{29 ‘3) have a family of solutions 

(29-4) y{x, a, i, c, d) , z{xy a, i, c, d) 

containing E 12 for values (x, a, b, c, d) satisfying conditions of the form 
(29*5) Xi ^ X ^ X 2 f “ Oq , & ~ bo , c ~ cq , d ~ do 

and forming a four-parameter family of extremals with properties similar to those 
described in Theorem 7 1. 

To prove the first part of the theorem, it should first be noted that the 
identities 

+ ff y + ^u^yv + = 0 , 

(29*6) 

W^ + HJVyt + HJVa^O , W,, + H, + HJVy, + H3V„--0 

are consequences of the fact that W satisfies the Hamilton- Jacobi equation 
(29 *2). They are found by differentiation of that equation with respect to u, 6, 
y, z. If the functions 

y(x), z(x) {xi^x^xt) 


’See Jacobi, op. cU., pp. 157 ff. 
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defining E 12 are substituted in Wy and Wg to form two new functions u(x), v(x), 
then it is readily seen, with the help of equations (29-3), that 

W„+Wayy'+WagZ'^ 0 , u'=-Wyg+Wyyy'+WygZ ' , 

W,,+Wtyy'+lVtgz' = 0 , v'^Wg,+Wgyy' . 

These equations and the set (29*6) show that y(x), z(x), u(x), v(x) satisfy the 
canonical equations (26 * 7) and hence that £12 is an extremal. 

The last part of the theorem is a consequence of the fact that the values (x, y, 
z, a, by Cy d) belonging to £12 form a set satisfying equations (29 • 3) on which the 
determinant WayWhn — WatWhy is different from zero. Hence, equations (29*3) 
have solutions (29*4), including £12 for values (29*5) and having continuous 
partial derivatives of at least the first order in a neighborhood of these values. 
The functions u = IF*, v = Wy, with (29*4) substituted, also have continuous 
first partial derivatives. Since from the first part of the theorem the functions 
(29*4) with u = Wyy v — Wg satisfy the canonical equations (26 *7), it follows 
that y*, Zx, Ugy Vg also have continuous first partial derivatives. These con- 
tinuity properties are similar to those described in Theorem 7 • 1 but not so ex- 
tensive. If the function W has continuous third partial derivatives, the func- 
tions y, Zy Uy V and their derivatives with respect to x have continuous second 
partial derivatives, exactly as stated for y, 2 , y*, Zx in Theorem 7 * 1. By substi- 
tuting the functions (29 *4) in equations (29*3) and differentiating the result- 
ing equations and u = Wyy v = Wg with respect to a, 6, c, d, it is readily seen 
that the product of the determinants 
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is the determinant 
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-Wya 
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-w„ 
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0 
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and hence that the value of the functional determinant 3(y, z, «, v)/d(a, 6, c, d) 
is everywhere different from zero. This establishes the fact that the final prop- 
erty of the family (7*5) also belongs to the family (29-4). 

A brief sketch of well-known results concerning the solutions of a partial 
differential equation 

(29-7) Fixiy . . . , Xn, pu • • • > ^n) * 0 • 
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where pi « dWIdXi (i* 1, . . . , will indicate how the preceding results fit 
into the general theory of the characteristic strips of such an equation.^ Let 
W{xi^ . . . ^ Xn^ a\y , , , , fln-i) be a solution of equation (29 *7) containing n — 1 
parameters a, (r = 1, 1). It is evident that for every value c 

IV fV (Xlj m , m f Xf^f dlf • . • y C 

will also be a solution. If C(ai, . . . , a»-i) is an arbitrarily selected function 
of aiy . . . y an-i the (» — l)-parameter family of solutions 

(29-8) W (xu . . . , Xnt du ...» On-i) -C(ai, . . . , 

will have an envelope determined by solving the equations 

(29-9) 0 (r=l, n«l) 

for di, ... j fln-i as functions of JCi, . . . , Xn and substituting in equation (29*8). 
It is easy to see that this envelope is also a solution of equation (29- 7). The 
curve in the space of (w + l)-dimensional points (jci, . . . ,XnfW) and the strip 
of normals pi : . . . : pni — I associated with this curve by setting pi = Wxi, 
has the property that along it a particular solution (29*8) touches the envelop- 
ing space.® Such a strip is called a ^'characteristic strip*’ and is determined by 
equations (29*9) and (29*8). The differentials dxty dp^ along a characteristic 
strip satisfy the equations 

Wa^^d x.^0 , dpk- WxkZid Xi , 


where it is to be understood that terms containing a repeated subscript i stand 
for the sum of these terms for t = 1, ... y n, in accordance with well-established 
notations of tensor analysis. By substituting W{x, a) in equation (29*7) and 
differentiating with respect to a,, Xk, it is found that 


These equations and the preceding ones show that the characteristic strips all 
satisfy the differential equations 


dxi 


dXn dPi ^ _ dPn 


Since the function C(ai, . . . , a»-i) may be selected arbitrarily, all strips de- 
termined by equations of the form 


are characteristic strips. The results stated in Theorems 26- 1 and 29 1 now 
show that the extremals in the »y 2 «t^-space are characteristic strips of the 
Hamilton-Jacobi partial differential equation. 

• For this theory see, e.g., Gouisst-Hedrick, op. dt.. Secs. 85-87. 

* Goursst-Hedrick, ilM., p. 262. 
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30. An application in dynamics. The differential equations 
(301) = 

are the equations of motion for a particle of mass m moving in a field of force 
whose components are the partial derivatives of the function —U(x, y, s, /) 
with respect to oc, y, z. It is understood that the primes indicate derivatives 
with respect to the time t and that the motion is described by three functions 
y(0) ^(0 which satisfy the equations, and which with their derivatives 
take initial values specified at a time /o by the initial position and velocity of 
the particle at that time. One may now readily verify the truth of the following 
theorem: 

Theorem 30- 1. Hamilton’s Principle. The differential eqmtions (30 1) of 
the possible trajectories of the moving particle described above are also the differential 
equations of the extremals of the integral 

iT-V) dt, 

where T is the kinetic energy 
of the particle. 

The problem of the calculus of variations thus introduced is in the four- 
dimensional /jcys-space instead of the :rys-space of the preceding sections, but 
for the calculation of the Euler differential equations this necessitates only a 
change in notation from a; to ^ as independent variable and to three instead of 
two equations. 

An important application of Hamilton’s principle lies in the recognition of 
the fact that minimizing curves of the integral I will go over into minimizing 
curves after a transformation of the coordinates x, y, z into any other coordi- 
nates 9 i, q 2 j qz by means of equations of the form 

(30-2) X’^xiqi, qi, g,) , y^yiqi, gj, g»), «“*(gi, g*, gs). 

Without the necessity of further analysis it is then clear that in the new coordi- 
nates the differential equations of motion are the equations 

d (T- U) _ 
dt dqi ’ 

where it is understood that the expressions (30*2) and their derivatives 

with respect to t have been substituted in T and U. 

The canonical coordinates t, g<, Pi corresponding to t, qt, ql are defined by 
means of the equations 
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which are linear in the derivatives and therefore easily solvable. The variables 
Pi are called moments, since in terms of the original coordinates x, y, z they are 
the products 

Pi=mx', P 2 =my', Pt = mz’ 

of the mass times the projections x', y', z' of the velocity. The kinetic energy T 
is a homogeneous quadratic form in the variables q[, as in the original velocities 
x', y', z'; and Hamilton’s function H(t, q, p) is found, with the help of this fact, 
to have the value 

H{t, q, p) =T+U. 


The canonical equations of motion take the simple form 

dpi dH 
dt dPi’ dt dqi’ 

When the function U does not contain / explicitly, the same is true of 11; and 
these equations imply readily the principle of the conservation of energy 


dH 

dt 




dqi 

dt 




dPi _ 


^'dt 


= 0 , 


which says that along every trajectory the sum H of the kinetic energy T and 
the potential energy f/ is a constant. 

The partial diflFerential equation of Hamilton and Jacobi is 

(30-3) Wi + H(t, q,W,) =0. 


If a solution W(l, q, ai, <h, cn) is known, the equations of the trajectories are 


where the o,- and are arbitrary constants. The case when 17 and there- 
fore H do not contain / explicitly is again an important one. In this case, if 
W(q, ai, 02, Oz) is a solution of the equation 

Biq, W„) =as » 

then a solution of (30-3) is W{q, a) — otjt, and the equations of the extremals 
&r6 


31. Bxtremals as curves of quickest descent. Caratheodory^^ has emphasized 
geometric properties of extremals which he makes the basis of a very interesting 
approach to the theory of the calculus of variations. In order to exhibit these 
properties for the three-dimensional case studied in the preceding pages, let 

(3M) y, z) 

10 “tJber diskontinuirliche Losungen in der Variationsrechnung” (diss., G5ttingen, 1904). 
See also chap, v, on “Variationsrechnung,” by Carath€odory in Von Mises, Die Differential^ 
und Integral-glekhungen der Mechanik und Physik, and Cara^^odory, Variatiansrechnung und 
partieUe Differentialglekhungen erster Ordnung (1935), pp. 24^51. 
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be a one-parameter family of surfaces defined by a function W{x, y, z) which 
has continuous partial derivatives of at least the second order in a region F of 
j^z-space. Consider an admissible arc 

(31-2) y(x), z(x) (xi^x^xt) 

in the region F, having no comers and nowhere tangent to one of the surfaces 
(31 • 1). The equation 

W[x, y(x), z(*)] = M 

then determines x as a function xip) with a continuous derivative 

on the interval of values /i belonging to the arc (31 • 2), as one sees by the 
usual applications of implicit function theorems. The value of the integral 

/ = / 'fix, y, z, y, z')dx, 

*'*1 

taken along the arc (31-2) from the point i to a variable point defined by 
X 2 = is a function of /i which has the derivative 

(31-3) / 

' dn W^+W^y+W.z” 

At each fixed point (x, y, z) this derivative is a function of y' and z', and the 
direction 7 : y' : z' in which it has its minimum value is called by Carath6odory 
the direction of quickest descent. 

In accordance with the well-known rule for finding a minimum of a function 
of two variables y' and z', the direction of quickest descent at a point {x^ y, z) 
is determined by setting equal to zero the partial derivatives of the expression 
(31-3) with respect to y' and z'. This gives the equations 

W.+Wyy^+W,z^) U 0 , 

iW,+Wyy+W,z') // , 

which, after solving for the ratios of PT,, Wy, Wg, are found to be equivalent to 
the equations 

(31-4) / - y'A' - *7/ = W* , U » XPT, , // « \Wg , 

where X is a factor of proportionality. From the definition of transversality in 
Section 28 the truth of the following theorem is evident: 

Theorem 31 * 1. necessary and sufficient condition for an arc (31-2) to have 
the direction of quickest descent at its intersection point (x, y, z) with a surface of 
a family W(x, y,z) ^ is is that the surface be transversal to the arc at that point. 



79 


FIELDS AND THE HAMUTON-JACOBI THEORY 

Let us suppose now that for a given family of surfaces W{x, y, z) - the 
equations (31*4) have a-solution (xq, yo, 2o, yi, Zo> ^o) for which the set (xq^ yo, 
2o, yoi 2o) is admissible and gives / and the determinant — (/y'# 0* values 

different from zero. The functional determinant of the equations (31*4) with 
respect to y', s', X, after transposing the second members, is readily found to be 

- W.+Wy^ +IY,s') - (/y'-O *] . 

This is different from zero at the initial set («o, yo, So, yi, zi), on account of the 
hypotheses just made, since with the help of equations (31*4) it is seen that 

/=X(JF.+»'^y'+ir.s') 

at the initial set, and since / is different from zero by hypothesis. From the usual 
implicit function theorems it follows that there exists a region F of zyz-space 
in which the equations (31*4) have solutions 

(31*5) y, z), 2 '=j(x, y, z) , X = X (a:, y, z) 

with continuous partial derivatives of the first order at least, and in which 
the sets [x, y, z, p{x, y, z), g(jc, y, z)] are all admissible and make / different from 
zero. It will be supposed from now on that the family of surfaces (31 • 1) is a 
family simply covering a region F in which equations (31*4) have solutions 
(31*5) as just described. The solutions of the first two equations (31 *5) form a 
two-parameter family of curves in the region F, each cut transversally by the 
surfaces of the family W = fjt. They are called the curves of quickest descent of 
the family. 

Two surfaces of the family (31 • 1) are said to be geodesically equidistant if 
the values of the integral I on the segments of the curves of quickest descent 
bounded by the two surfaces are all the same. If every pair of surfaces in the 
family has this property, then equation (31*1) is said to define a family of geo- 
deskally equidistant surfaces. For such a family the following theorem can be 
proved: 

Theorem 31*2. Every family of surfaces W(x, y, z) = ^ with the properties 
described in the next to last paragraph in a region F of xyz-space is intersected 
by a two-parameter family of curves of quickest descent. A necessary and sufficient 
condition that the surfaces constitute a family of geodesically equidistant surfaces 
is that there be a representation W(x, y, z) = /a of the family for which W satisfies 
the Hamilton- Jacobi partial differential equation 

(31*6) W, + H{x, y, z, Wy, 

In that case the curves of quickest descent are all extremals. The region F is a field 
with the slope functions of these extremals and with the family W ^ as its family 
of transversal surfaces. 

The first statement of the theorem has already been justified. 

To prove the necessity of the condition in the second sentence of the theorem, 
it should first be noted that for a family of geodesically equidistant surfaces the 
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values of the integral /, taken along curves of quickest descent from a fixed 
surface W n\ol the family to a second surface IT = /i> define a function 
^(/i) with a non-vanishing derivative since this derivative is equal at 
every point to the expression (31-3). Hence, if equation (31*1) is replaced by 
ip{W) = V, the value of the derivative dl/dv is unity. Let us suppose that this 
change of representation for the family (31*1) has already been made, so that 
the value of dl/dii in equation (31*3) is unity. By multiplying the last two 
equations (31*4) by y' and s', respectively, and adding to the first, it follows 
that X = 1. If the solutions of the last two equations (31*4) for y' and s' are 
substituted in the first equation (31*4), it is found by the argument following 
equation (28 -4) that W satisfies the Hamilton-Jacobi equation (31*6). 

Conversely, if W satisfies equation (31*6), then the slope functions p and q 
defined by equations (28 *5) will satisfy equations (28 *4), which are equivalent 
to equations (31*4) with y' = p, s' == g, X = 1. The solutions of the differen- 
tial equations y' = s' = j in (31*5) are therefore the curves of quickest 
descent of the family IV = /*. On one of these curves 

as is seen with the help of equations (31-4) when X = 1. It follows readily that 
the surfaces IT = ju are geodesically equidistant. Theorem 28 - 1 now justifies 
the final statements of the theorem. 

The theory of Carath^odory thus brings us back again to the theory of fields 
and their relationship to the partial differential equation of Hamilton and Jaco- 
bi, which was discussed in detail in Section 28. 



CHAPTER IV 

PROBLEMS IN THE PLANE AND IN HIGHER SPACES 

32. Introduction. The preceding chapters of this book have been devoted to 
the simplest problems of the calculus of variations in three-dimensional space, 
the so-called “problems with fixed end-points.” That part of the theory of the 
calculus of variations has been selected as an introduction because it has all 
the essential features of corresponding problems in higher spaces and because 
its results can be easily interpreted for the analogous problem in the plane. The 
theorems and proofs in the preceding chapters have been so formulated that 
they are applicable, with only slight changes in notation and wording, to prob- 
lems in either two or higher dimensions. 

Sections 33 and 34 of the present chapter are devoted to a summary of re- 
sults for the theory of the problem in the plane. In Section 33 the theorems 
concerning necessary conditions and sufficient conditions for a minimum are 
discussed, while in Section 34 the more significant differences between problems 
in the plane and those in three-space are emphasized. 

The remaining sections of the chapter. Sections 35" 39, are devoted to the 
fixed end-point problem in spaces of higher dimensions. Instead of (x, y, z) the 
coordinates used are (x, yi, . . . , yn), and repeated use is made of the summa- 
tion convention of tensor analysis according to which a term containing a re- 
peated literal subscript stands for a sum. Thus, the expression y»Zi is a symbol 
for the sum yiZi + . . . + ynZn. The results described in Sections 35 and 36 are 
the analogues of those in Sections 33 and 34 and preceding chapters concern- 
ing necessary conditions and sufficient conditions for a minimum and concern- 
ing the determination of conjugate points. In Section 37 the criteria character- 
izing w-parameter families of extremals which simply cover fields, and in Sec- 
tion 38 the Hamilton- Jacobi theory, are summarized because of their rather 
marked notational variations from the corresponding parts of Chapters II and 
HI for the three-dimensional case. In the last section of the chapter. Section 39, 
the theorems of Section 23 concerning the second variation are generalized, and 
a more comprehensive presentation of some parts of the theory of the second 
variation is given. This will be useful in the following chapters. 

In perusing the present chapter the reader should bear in mind the fact that 
the results here described are in the nature of a summary. The details of proofs 
and theorems for the plane and higher spaces can easily be inferred from those 
in the preceding chapters, which should constantly be compared with the text 
here. The difficulties encountered in the process of making these inferences are 
only the notational ones involved in passing from three to two, or from three 
to more than three, dimensions. 
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33. The problem in the plane. The non-parametric problem of the calculus 
of variations in the plane is concerned with an integral of the form 

{33- 1) /- pfix, y, y)dx 

•'*1 


and with a class of admissible arcs 

y (x) X2) , 

on each of which the integral I has a well-defined value. The problem which 
will be treated in this section is that of finding, in a class of admissible arcs join- 
ing two fixed points I and one which minimizes the integral /. 

In order to define more explicitly a class of admissible arcs on each of which 
the integral 1 has a well-defined value, it will be supposed, as in Section 3, that 
the integrand function f(Xy y, y') has continuous derivatives up to and including 
those of the fourth order in a region R of {x, y, y')-space. Admissible sets (x, y, y') 
are those interior to the region R, and admissible arcs for the problem here con- 
sidered may be defined in a manner quite similar to the one adopted in Section 3. 
The analogues of the conditions discussed in Chapter I and their consequences, 
which may or may not hold on an admissible arc E, are described in the follow- 
ing paragraphs. 

L An admissible arc E is said to satisfy the condition I if there exists a constant 
c sttch that the equation 

(33*2) Pfydx + c 

is an identity along E. 

It is evident that, when this property holds on an admissible arc E, the fol- 
lowing consequences, analogous to those of the Corollaries 6-1, 6*2, and 6*3, 
are readily provable. On each sub-arc between comers of E the function fy* has 
a derivative, and the Euler equation 





is satisfied. The right and left limits of the function /»'[», /(x)] are equal 

at each value of x on the interval XiXj belonging to E, including those values 
of X which define comers of E. Furthermore, on every sub-arc of E between 
comers on which the derivative /,'«' is different from zero the function y(x) de- 
fining E has a continuous derivative of the nth order, provided that the inte- 
grand function /(x, y, yO has continuous partial derivatives of order n near the 
set of values (x, y, y') belonging to the sub-arc. 

In accord with the notation of Section 9 we define the £-function of Weier- 
strass by the equation 

y, y', n-/(*, y, r)-f{x, y, yO - (F'-yO A' (x, y, y'). 
Then we have the two further conditions: 
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II. An admissible arc E is said to satisfy the condition of Weierstrass if at 
every element (x, y, y') on-'E the condition E(x, y, y', Y') ^ 0 is satisfied for every 
admissible element (x, y, Y') 9 ^ (x, y, y'). 

III. An admissible arc E is said to satisfy the condition of Legendre if at each 
element (x, y, y') of E the derivative l^y is non-negative. 

To state the condition of Jacobi, we remark first that an extremal is an ad- 
missible arc defined by a function y(rc) having a continuous second derivative 
and satisfying the equation 

(33-3) + fy’yy' + U.-y - /v = ^ = 0 . 

Such an arc is said to be non-singular if the function /y'y' is different from zero 
on it. A point J on a non-singular extremal arc £12 is then said to be conjugate 
to the initial point 1 of £12 if J is a contact point of the arc £12 with the en- 
velope of a one-parameter family of extremals through the point 1. This is the 
definition of Section 10 in terms of which the fourth condition has the follow- 
ing form: 

IV. A non-singular extremal arc E 12 is said to satisfy the condition of Jacobi 
if it has on it between 1 and 2 no point 3 conjugate to 1. 

By the methods of Chapter I it can be shown that every minimizing arc £ 
must satisfy conditions I, II, and III and that a non-singular minimizing arc £is 
without corners must further satisfy condition IV, since such an arc necessarily 
satisfies the condition I and is an extremal, by the proof in Section 16. Let us 
use again the notations IIjv, HI', Illjr, IV' for the conditions II, III, IV 
strengthened in the manner described in Section 16. Then the table of neces- 
sary and of sufficient conditions for a non-singular admissible arc without cor- 
ners to be a minimizing arc, given at the beginning of Section 17, is applicable 
to the problem in the plane here considered. These results hold also for a 
maximizing arc, provided that the inequalities in the conditions II and III are 
changed in sense. 

The proof of the necessity of Jacobi's condition by means of the envelope 
theorem, based on the geometric definition of a conjugate point given above, is 
subject again to the limitations described in Section 10. Th6 analogue for the 
problem in the plane of the analytic definition of a conjugate point in Section 11 
is given in Section 34 below. 

34. A comparison of the problems in the plane and in three-space. While 
the problem in the plane described in the preceding section and the problem in 
three-space of Chapters I-III have many essential similarities, there are, 
nevertheless, some notable differences between them. In the next few para- 
graphs the divergences of the theorems for the two problems will be examined 
in some detail. 

It was shown in Theorem 7 - 1 that every non-singular extremal arc £ for the 
problem in three-space is imbedded in a four-parameter family of extremals. An 
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analogous discussion of equation (33*3) shows that for the plane problem a 
non-singular extremal arc E is imbedded for values ao, 6o in a 

two-parameter family of extremals 

(34* 1) y{x, a, h) 

with continuity properties analogous to those of the family of Theorem 7 * 1 and 
having the determinant 

ya yh 

yaat ybx 

different from zero along E. 

It should also be noted that the Legendre condition III of Section 9, in- 
terpreted for the problem in the plane, is concerned with the sign of a quadratic 
form in a single variable, so that the condition as stated above for the plane 
case involves only the sign of the single coefficient fyy of the form. 

One can easily verify the fact that for the problem in the plane the second 
variation hiri) along an admissible arc En has the form 

2 co(x, 17 , vi')dx, 

-I 

where the integrand is the quadratic form 

(34*2) 2<aiXf 71 f 17 ^) fyyTI^ "h 2 fyy* ri7l^ , 

The accessory minimum problem is that of finding in a class of admissible varia- 
tions 7i{x) vanishing at xi and X 2 one which minimizes hiTj). The Euler equation 
of this problem, 

(34*3) ^ 

is called the “accessory differential equation.” It was first used by Jacobi and 
is often called the “Jacobi equation.” 

One of the essential differences between problems in the plane and in three- 
space lies in the analytic method of determining conjugate points. By analogy 
with the definition of Section 11 we say that a point 3 on the extremal arc £12 is 
conjugate to 1 if there is a solution 7i(x) of equation (34*3) which vanishes at 
Xi and Xz but is not identically zero between these values. To establish the re- 
lationship between this analytic definition of a conjugate point and the ge- 
ometric definition given in the preceding section, we may use arguments quite 
similar to those of Section 13. Consider a one-parameter family of extremals 
y(x, a) all passing through the pomt i, containmg the arc £12 for values xi ^ 
x^ XZf a aof and having the derivative ya{x, ao) not identically zero on £ 12 . 
The function ya{x, ao) is a solution of the accessory equation (34*3) belonging 
to £ 12 , by reasoning analogous to that of Section 12; and it vanishes at xi since 
all the extremals of the family pass through the point 1. Its zeros, therefore, de- 
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termine on En the points conjugate to 1. Moreover, in terms of the two-parame- 
ter family of extremals (34- 1) the points 3 conjugate to 1 on £12 are determined 
by the zeros of the function A(«, X\^ ao, bo), where 

, . yaU, a, b) ybix, a, b) 

A (x, Xu a, b) ^ 

a, b) yb^xu a, b) 

This can be readily seen by the use of the methods of Section 12. 

In a manner analogous to that of Section 18 we say that a region F of xy-space 
with a slope function p{x, y) is a field if p{Xy y) has continuous first pjirtial deriv- 
atives in F, if the elements [x^ y, pix, y)\ defined by points (x, y) in F are all 
admissible, and if in F the Hilbert integral 

/*= f [/dac+ (dy-p dx) fy'] 

with the arguments x, y, p(x, y) in place of x, y, y' is independent of the path. 
It is readily seen, as in Section 18, that through every point of a field F there 
passes one and but one solution of the differential equation of the field 

(34-4) 5-) 

and that every solution of this equation is an extremal. The totality of extremals 
of a field may therefore be regarded as a one-parameter family. 

Perhaps the most striking difference between the problem in the plane and 
in three-space lies in the method of constructing fields. This may be clearly 
seen by contrasting Theorem 20 - 1 with the following analogue of that theorem 
for the problem in the plane: 

Theorem 34 * 1. // o one-parameter family of extremals 

(34*5) y a) [tti ^ a = a 2 > Xi Ca) ^ ^ X 2 (a) ] 

is cut by a curve C defined on the family by a function x = {(a) (ai g a ^ 02 ) and 
if the family and the intersecting curve have continuity properties like those de- 
scribed in Section 20, then every region F of the xy-plane which is simply covered 
by the extremals is a field with the slope function p(x, y) of the family, provided 
that the derivative y«(x, a) is different from zero at each set of values (x, a) corre- 
sponding to a point (x, y) in F. 

The proof of this theorem need not be given here since it is similar to that of 
Theorem 20 - 1. In view of this result, it is important to note that, in order for a 
one-parameter family of extremals having suitable continuity properties to form 
a field in a plane region F, no properties of the family are required other than 
intersection by a curve C, the simple covering of the region, and the non-van- 
ishing of the derivative ya{x, a). The reason for this is that on the intersecting 
curve C in the plane there is only one arc joming two given points, and the 
Hilbert integral is necessarily independent of the path without further assump- 
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tions; whereas on an intersecting surface (20*3) in xyz-spSLce there are infinitely 
many arcs joining two given points, and the independence property is not, in 
general, valid. Furthermore, a slope function p(x, y) with suitable continuity 
properties in a simply connected region F of the ocy-plane will make the integral 
1* independent of the path in F provided only that the solutions of the differ- 
ential equation (34 *4) are all extremals. In contrast to this, it has been seen in 
Theorem 22 • 1 that for the three-dimensional case the slope functions p{xy y, «), 
q{x^ y, z) of a field must not only be the slope functions of a two-parameter 
family of extremals, as specified in the first two equations (22 * 1), but must also 
have the additional property required by the third equation (22 1). For the 
problem in the plane there is no condition analogous to that of Theorem 22-2, 
since the family (34*5) depends upon only one arbitrary constant. Every one- 
parameter family of extremals in the plane is a Mayer family. The reader should 
examine again the statements in the next to the last paragraph of Section 22. 

In concluding this section the attention of the reader is also called to the very 
powerful and useful fundamental sufficiency theorem. Theorem 19*2, which 
with suitable simple modifications in its statement is also valid for the problem 
in the plane. 

35. The problem in a space of higher dimensions. The last fixed end-point 
problem in non-parametric form which we shall consider in this chapter is a 
generalization from the problem in three-space to a problem in space of (n + 1) 
dimensions whose points have coordinates (x, yu ^ yn). The problem to be 
considered involves an integral of the form 

f {x, yi, . . . , yn, Vi, . . . , yn) dx 

-I 

and a class of so-called ''admissible arcs” 

yiix) {xi^x^X2; 1, . . . , n) , 

on each of which the integral has a well-defined value. The problem is then that 
of finding, in the subclass of admissible arcs joining two fixed points 1 and one 
which minimizes the integral. A class of admissible arcs may be defined in a 
manner quite analogous to that of Sections 3 and 33. To simplify the theory we 
frequently use the notations 

'v * (yi, . . . , yj , y' « (yi, . . . , yn) » 
and for the fixed end-points 1 and 2 the notations 

yi * (yii, . • . > yni) > y* (yw, - . . i yn2) , 

where no confusion will be caused thereby. 

The first condition on an admissible arc £, analogous to that of Section 6, 
then takes the followmg form: 
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I. An admissible arc E is said to satisfy the condition I if there exist constants 
Ci (! == 1, . . . , n) snch that along the arc E the equations 

(35-2) fvi= f fvidx + Ci (i=l, ...,n) 

are identities in x. 

On an admissible arc E satisfying condition I the right and left limits of the 
functions /yjfac, y(x), y\x)] are equal even at corners, and on sub-arcs between 
corners they have derivatives such that 

(35-3) 

as we see by the methods of Section 6. In the notation now being used, the 
Hilbert differentiability condition, analogous to that of Corollary 6 *3, asserts 
that, if an arc E satisfies the condition I, then on every sub-arc of E on which 
the determinant 

(35-4) 

is different from zero the functions y,(jr) defining E have continuous derivatives 
of order w, provided that the integrand function /(x, y, y') has continuous par- 
tial derivatives of order m near the set of values (x, y, y') belonging to the 
sub-arc. 

For the problem in (w + l)-dimensional space here considered, the E-func- 
tion of Weierstrass, analogous to that of Section 9, is defined by the equation 

E{x, y, y', F')=/U, y, Y')-fix, y, y') “ ( F'^-y',) (x, y, yO , 

in which the last term represents a sum fori = 1, . . . , w, as was stipulated in 
a preceding paragraph. The conditions of Weierstrass and Legendre have the 
following forms: 

II. An admissible arc E is said to satisfy the condition of Weierstrass if for 
every set of values (x, y, y') belonging to E the condition E(x, y, y', Y') ^ 0 is 
satisfied for every set (x, y, Y') which is admissible and different from (x, y, y'). 

III. An admissible arc E is said to satisfy the condition of Legendre if at each 
set of values (x, y, y') belonging to E the condition 

y')ir,7rjfc^0 

is valid for all sets of constants t = (tti, . . . , Xn) such that irim = 1. 

A geometric definition of a conjugate point can be phrased, and the Jacobi 
condition formulated in terms of it, in a manner similar to that described above 
for the problems in the plane and in three-space. It should be remarked first that 
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an extremal is an admissible arc defined by functions having continuous 
second derivatives and satisfying the equations 

(35*5) fvi^ ^ (i— !,•••» w) • 

Such an arc is said to be non-singular if the determinant (35*4) is different from 
zero along it. A point 3 on a non-singular extremal arc £12 is then said to be 
conjugate to 1 if it is a contact point of £12 with the envelope of a one-parameter 
family of extremals through the initial point 1, It is evident that this definition 
is the natural generalization of those given in Sections 10 and 33 and that the 
fourth condition has the following form: 

IV. A non-singular extremal arc E 12 is said to satisfy the condition of Jacobi 
if it has on it between 1 and 2 no point 3 conjugate to 1. 

Every minimizing arc £ must satisfy the conditions I, II, HI; and a non- 
singular minimizing arc £ without comers must further satisfy the condition 
IV. For a non-singular minimizing arc without comers the theorems implied 
in the table of Section 17 are again applicable, provided that we understand 
by IIjv, Iir, Illjp, IV' the conditions II, III, IV strengthened in the manner 
described in Section 16 for the three-dimensional case. The theories of problems 
in the plane and in higher spaces are quite analogous at this stage. For a maxi- 
mizmg arc the inequalities in the conditions II and III must be changed m sense. 

36. The determination of conjugate points. For the problem in {n + 1)- 
dimensional space the application of the existence theorems for differential 
equations to equations (35*5), in analogy with the arguments of Section 7, 
shows that every non-singular extremal arc £ is imbedded for values xi^x^ 0 : 2 , 
aio, bi iio in a 2»-parameter family of extremals 

(361) y<(«, a, 6)«y.(ap, oi, ..., On, * 1 , K) 

with continuity properties like those in Theorem 7 • 1 and having the determi- 
nant 

y%xaj^ yijAj, 

different from zero on the arc £». It will be seen later in this section that the 
family (36 * 1) plays an important role in the determination of conjugate points. 

To define conjugate points on the arc £12 analytically, it should first be 
noted that the second variation hiyj) along £ 12 , for the problem in (n 4- 1)- 
space here considered, has the form 

(36*3 ) 72 (iy) - n, ri^)dx, 

where Iw is the quadratic form 

(36-4) 2«(» 1 ?, 1 ?') « + 


(36*2) 
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with the arguments rr, y{x)y y(x) belonging to En substituted in the derivatives 
of /. The accessory minimum problem along Eu is then that of finding in the 
class of admissible variations rii{x) joining the points (xi, 0 ) and (x 2 , 0 ) in xri- 
space one which makes the integral h a minimum. The accessory equations for 
the arc E 12 are by definition the differential equations 


(36-5) 




(i = 1 , . . . , ft) 


of the extremals of this accessory minimum problem. 

A point 3 on an extremal arc £12 is conjugate to the initial point 1 if there 
exists a solution rj^{x) of the accessory equations (36 • 5) with elements vanish- 
ing at Xi and X 3 but not all identically zero between these values. This analytic 
definition of a conjugate point is equivalent to the geometric definition in the 
last section above, by arguments similar to those of Section 13. 

Let the functions 

(36-6) yiix, a) =y<(a;, ai, oj 


define an «-parameter family of extremals containing the arc £12 for values 
(Xj a) satisfying conditions of the form Xi^ x ^ x^, a» = a,o, having continuity 
properties analogous to those described in Theorem 7 • 2, and passing through 
the fixed end-point 1 of £12. Each of the n sets of functions yiatSj^i ao) 
(A = 1, . . . , n) is a solution of the accessory equations (36 S). If the deter- 
minant A(Xy ao) = \yiak{x, ao)| of these solutions is not identically zero, its 
zeros determine the points 3 conjugate to 1 on £12, in analogy with the result 
stated in Corollary 12 -2. Furthermore, these conjugate points are also deter- 
mined by the zeros of the function D{x, xi, ao, 60), where D is the determinant 


(36*7) D(x, xi, a, b) 


yiak(x, a, *) 
yu»*(xi, a, b) 


yxbtix, a, b) 
yxbkixu a, b) 


formed for the family (36- 1), as is stated for the three-dimensional case in 
Corollary 12- 1. 

37. The construction of fields. In (n + l)-dimensional space a field is a 
region £ of (Xy y)-space with n slope functions pi(xy y) (f = 1, . . . , n) having 
continuous first partial derivatives in F and such that the integral 


y, p)dx+ idyi-pydx) fy^ixy y, p)] 


is independent of the path in F when the arguments pi in the integrand are the 
slope functions Pi{Xy y). The solutions of the differential equations of the field 
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form an »-parameter family of extremals which simply cover F. These are the 
easily provable generalizations for (n + l)-space of the results in Section 18. 

In Sections 20 and 22 criteria were given for determining when a region F 
simply covered by a two-parameter family of extremals will be a field with the 
slope functions of the family. The analogous results for (» + l)-space are con- 
cerned with an ^-parameter family of extremals defined by functions of the 
form 

(371) yiix, a)^y^{x, ai, ..., a„) 

with properties analogous to those of the family (20 - 1). Let us consider such a 
family with an intersecting n-space S defined by equations 

(37-2) « = f (tti, . . . , ttn) = f (a) , y,- = y» [f (a) , a] = ly*- (a) 

analogous to (20 *3). By an argument like that used in proving Theorem 20- 1, 
it follows that every region F in (x, y)-space simply covered by such a family 
of extremals will be a field with the slope functions of the family, provided that 
the determinant 1 yiai, 1 is different from zero in F and provided, further, that the 
integral 1* belonging to the family is independent of the path on the inter- 
secting space 5. This latter property means that /* is independent of the path 
in the space of the parameters (ai, . . . , an) when in the integrand of I* the 
variables Xj y are replaced by the functions (37 • 2) and the variables pt by the 
functions y»*[$(a), a]. 

If an »-space 5 without singular points and with equations of the form 

(3 7'3) X ^ ( (aij • • • 9 On) “ $ (a) $ y» “ Vi (aij • • • » cin) “ Vi (a) 

is given in advance, then an ^-parameter family of extremals intersected by S 
and having the integral 7* of the family independent of the path on S can, in 
general, be determined. To find such a family we may first solve for the variable 
Vi the n equations 

(37-4) (/-/</«'<) {«* + /»'<’?.«*= 

in which the arguments x, y, y' are to be replaced by f (a), v(a), v\ and in which 
7/(a) is an arbitrarily selected function of the variables ai, . . . , an. Implicit 
function theorems assure us that a solution i 7 '(a) of equations (37 *4) will exist 
provided that these equations have an initial solution (a, ij')o at which the 
functional determinant of their first members with respect to the variables ty' is 
different from zero. We suppose, then, that a solution (a, iy')o exists such that 
the corresponding element [{(ao), vMy Vo] is admissible, has its direction 
1: i 7 io: . . . : ^no not tangent to 5, and makes the determinant \fyivk\ different 
from zero. The functional determinant of the equations (37*4) with respect 
to the variables vi is the product of the two determinants 

1 fvWt 1 1 ^ ««jb 1 
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and is different from zero at (a, lyOo, as a result of the hypotheses just made 
above. The »-parameter-family of extremals of the form (37 • 1), determined by 
the initial elements {(a), rj(a), is then intersected by the n>space (37-3) 
for X = {(a). Furthermore, the integral I* on this w-space is the integral of dU, 
as one sees with the help of equations (37 -4), and is consequently independent 
of the path. Hence the slope functions of the family form a field in every re- 
gion F which is simply covered by the family and in which the determinant 
lytajkl is different from zero. 

In Section 22 further criteria were deduced for slope functions and families 
of extremals forming fields in jcyz-space. For the problem of the calculus of 
variations in (n + l)-space the invariant integral I* described in the preceding 
paragraphs has the form 

I*=f [A(.x, y)dx+BAx, y)dy,], 

where 

A=f-ykfv'k, y\ = p,ix, y). 


By a generalization of the argument in Section 21 the conditions 

_ dB, dB^ _ dBk 

^ ^ ^ dy, dx^ dyu dy, 

are necessary for this integral to be independent of the path in a region F and 
are sufficient if F has the property of simple connectivity analogous to that 
defined in Section 21 for a three-dimensional region. The conditions (18*5) 
have as their analogues 





as one readily verifies. It follows then without difficulty, that the equations 
analogous to equations (22 • 1) and equivalent to (37 -5) have the form 

/vj ”” /v<* ““ fviUjP i **” / vWj PjviP f) = 0 > 

t ' t ) 

fv\vk + fvWjPwk ~ fvivi fvivjpjvt == ^ > 


in which the arguments of the derivatives of /are or, y, p in place of x, y, y'. For 
every field F in (x, y)-space the slope functions px{Xy y) satisfy these partial dif- 
ferential equations. Conversely, a region F with slope functions satisfying equa- 
tions (37 * 7) is certainly a field if it is simply connected in the sense described 
above. These are the analogues of the results stated for rcyz-space in Theo- 
rem 22*1. 

For an ^-parameter family of extremals (37*1) the analogues of the expres- 
sions (22 ■ 3) have the form 


daidak ^*' duk da{ 


(/.;•) 


f37-8) 
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in which the arguments jr, y* are the functions x, a), y\Xy a) from (37 • 1) 
and the partial derivatives are taken with respect to the variables af, a. As in 
Theorem 22 -2, the expressions (37*8) are constant on every extremal of the 
family for every pair of subscripts i and k; and, if the family simply covers a 
field F, these constants are all zero. Conversely, if the values of the constants 
(37*8) on the extremals of a family (37*1) are all zero, then every simply con- 
nected region F simply covered by the extremals, and having the determinant 
lytajbl of the family different from zero in F, is a field with the slope functions 
^,(«, y) of the family. An ^-parameter family of extremals (37 • 1) whose con- 
stants (37*8) are all zero is called a Mayer family. 

Evidently a family (37 * 1) whose extremals all pass through a fixed point 
(^i> yi) is a Mayer family, since at the value X\ the derivatives dyi/dak then 
all vanish and the constants (37 * 8) are all zero. It can also be proved that a 
family (37 • 1) intersected by a single space 5 of the form (37 • 2) on which the 
integral /* of the family is independent of the path is a Mayer family. For on 
such an intersecting space S the integrand of the integral /* has the form 


J dx + J^'^idyj-yidx) = (/{., + rfo.-, 

with the arguments x, y, y' replaced by {(a), y[f(a), o], y'[£(a), a]. If the integral 
I* is independent of the path on S, the conditions 

must hold. After a rather careful computation it can be seen that the first mem- 
ber of this last equation reduces to (37 *8). Thus we see that a region F of xy- 
space simply covered by an n-parameter family of extremals will be a field with 
the slope functions of the family if the extremals of the family all pass through 
a single fixed point or if the family is intersected by a single space S of the form 
(37 • 2) on which the integral /* of the family is independent of the path. 

38. The Hamilton- Jacobi theory. The canonical variables (x, y, z) of the 
problem of the calculus of variations in (x, yi, . . . , yj-space are defined in 
terms of the original variables («, y, y') by the equations 

(38*1) = y, y'). 

We shall assume, in analogy with Section 26, that these equations establish a 
one-to-one correspondence between a region R of interior points (x, y, y'), in 
which the function / is assumed to have continuous partial derivatives of the 
third order, and a region S of points {Xy y, z). We shall assume further through- 
out this section that the functional determinant \fyiyi\ is different from zero 
in R. The equations (38 * 1) then have single-valued solutions 

yi^Piix, y, z), 


(38*2) 
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which also define the one-to-one correspondence between the points of 5 and R 
and which have continuous derivatives of order w — 1 in 5 if / has continuous 
derivatives of order m in R. 

A Hamiltonian function H(x, y, z) may be defined by the equation 
(38-3) y, P). 

Its derivatives are found, by easy calculations, to be 

/» (^j y> P^ 9 P^ 9 t^*i^Pii^9 y9 


and H has continuous derivatives of the third order when /has that property. 
Every extremal arc y,(ir) (xiS x ^ ^ 2 ) in the region R is transformed by equa- 
tions (38-1) into an arc y*W, Zi{x) {xi^ x ^ X 2 ) in S satisfying the canonical 
equations 


(38-4) 




dZi 

dx 




Vit 


and conversely. 

By the method of Section 27 we may prove the imbedding theorem for a 
non-singular extremal arc £12 in (x, y)-space by means of the existence theorems 
for differential equations applied to the canonical equations (38*4). The re- 
sults of the imbedding theorem described in Section 36 are obtained in this way 
even when the integrand function / has derivatives only of order three, instead 
of four, in the region R of admissible elements (x, y, y'). The family of extremals 
thus found is defined in the space of the canonical variables x, y, z by functions 
of the form 


(38*5) yiix, a, 6), z, (a:, a, b) , 

which define the extremal arc En for values 

Xi^ X ^ X2 f ••• 7 ®no) 9 ^0 ~ ( biOf • • • , ^no) • 

In a neighborhood of these values the functions y*, z„ y,„ Zix have continuous 
second derivatives and have their determinant 

y.«* y*6fc I 


(38-6) 


different from zero. It is easy to see that this determinant is the product of the 
determinant 

dik 0 

/w<Vfc fv{ii 


by the determinant (36 • 2), so that the non-vanishings of (36 • 2) and (38 • 6) are 
equivalent. The family may, in particular, be chosen so that the parameters a, 
b are the initial values of y, 2 at a value xo, so that 

ai « y» (xo, a b ) , 6.- « (xo, a, b) . 
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At the value xo the determinant (38 -6) is then easily seen to be the identity 
determinant. 

An M-space W(x, y) = constant is transversal to a direction 1 : y[: . . . :y!^ 
at a i)oint (x, y) if the equation 

(/ - y</vi) + fv'iiyi = 0 

is satisfied by every set of differentials dx^ dyi for which 

Since in a field F the integral 

/*=/[(/- yi-Ai) dx + A;dy.-] 

formed with the slope functions Pi{x, y) of the field is independent of the path, 
its value taken from a fixed point {x, y)o to a variable point (x, y) of the field 
is a single- valued function W(x, y) whose first derivatives are readily seen to 
have the values 

(38-7) = 

in which the arguments of / and its derivatives are x, y, p(x, y). Evidently the 
function W has also continuous second derivatives in F. From the equations 
(38*7) it follows that the ^-spaces W(x, y) = constant all cut the extremals of 
the field transversally. They are called ‘‘transversal n-spaces of the field.’* 
From equations (38-7), (38-2), and (38 1) it is evident that the slope func- 
tions of the field are the functions 

(38-8) pAx, y) =PAx, y, Wy) , 

and it follows from the definition (38 *3) of the function // that W{x, y) satisfies 
the partial differential equation 

(38-9) W^ + H{xy y, IF,) = 0 . 

This equation is called the “partial differential equation of Hamilton and Ja- 
cobi.” From what has been said above, it is evident that every field F has a 
one-parameter family of transversal «-spaces IF(x, y) = constant for which the 
function W has continuous second derivatives in F and satisfies the partial dif- 
ferential equation (38*9). One can prove, conversely, as in Section 28, that 
every solution W{Xy y) of the equation (38 9) with continuous second deriva- 
tives in a region F of (x, y)-space defines a field in F with the slope functions 
(38 • 8) . These are results for fields analogous to those of Theorem 28*1. We may 
also easily generalize to (n + l)-space the geometrical properties of a field de- 
scribed in Theorem 28 *3. 

The extremals of the problem of the calculus of variations which we are con- 
sidering can be determined by a well-known method due to Jacobi if a solution 
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W(x, y, 0) of the equation (38-9) containing »* parameters 0 ■= (jSi, . . . , /5«) 
is known. By analogy with Section 29 let W(x, y, 0) be such a solution with 
continuous partial derivatives of the second order and with a determinant 
I different from zero in a region consisting of all sets (x, y, 0) for which 
(*, y) is in a region F and 0 » (ft, . . . , ft) in a region B. Then every admissible 
arc E in the r^ion F, defined by functions 

(3810) yi(x) (*i^ * 2 ; t* 1, ...,») , 

having no comers, and satisfying the equations 
(3811) 

for particular constant values (a, /3)o> is an extremal. Furthermore, the equa- 
tions (38-11) have a 2»-parameter family of solutions 

(38-12) y.(x, a, 0) 

containing E for values x, a, 0 satisfying conditions of the form . 

Xi ^ X ^ X 2 , a = ao , 0 — 00 

and forming a 2M-parameter family of extremals with properties analogous to 
those described in Theorem 7 -1. The functions 

z. (x, o, 0) =lF,.[x, y(x, a, 0) , /3] 

with the functions (38*12) define the 2n-parameter family of extremals in the 
space S of the canonical variables {Xy y, z). These results are analogous to those 
of Section 29 for three-dimensional space. The attention of the reader is called 
to the final paragraphs of Section 29, in accordance with which the extremals 
are seen to be closely associated with the characteristic strips of the Hamilton- 
Jacobi equation (38 *9). 

In the preceding paragraph it was shown that, when a solution W{Xy y, j8) 
of the Hamilton-Jacobi equation with 5 ^ 0 is known, the extremals 

can be determined as solutions of the equations (38*11). Conversely, when 
the extremals are known, a solution W(xy y, P) of the Hamilton-Jacobi equation 
(38*9) with the properties described above can be constructed. To show this 
we construct a family of fields in a region F of xy-space with slope functions 
depending upon n parameters jS = (ft, . . . , pn)- The invariant integral /*, 
taken from a fixed point (jco, yo) to a variable point («, y) in F, then defines a 
solution W(Xy y, P) of the Hamilton-Jacobi equation with the properties desired. 

To justify this statement in full, consider an »-parameter family of extremals 
(37 * 1) cut by a space S, as in (37 *2), on which the integral I* of the family is 
independent of the path, and simply covering a bounded and closed field F. 
The value of the integral I* on S taken from a fixed point ao to a variable point a 
is a function [/(a) with derivatives defined by equations (37 *4). The equations 

(38*13) + Ua, + Pk, 
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which reduce to equations (37*4) for jSt » 0, have the initial solutions (a, /9, 
vO (ooi 0, j)o) belonging also to equations (37-4). Consequently, at this solu- 
tion the functional determinant of the first members of the equations (38-13) 
with respect to the variables ri' is the same as that of equations (37-4) and is 
therefore different from zero. Hence the equations (38-13) have solutions 
i;'(a, 0) such that 0) = vo' The extremals in zy-space determined by the 
initial values (x, y, y') — (Ko), ^(a), i}'(ai Z^)] iorm a 2»-parameter family 

(38-14) Yi(x, a, /S) 

satisfying on the initial space 5 the initial conditions 

F,-[{(a), a, j3] = i;,- (a) , F<[|(a), a, /S] = (a, /3) , 

and their consequences 

From these equations it follows that the determinant analogous to (36 - 2) for 
the family (38-14) is equal on 5 to the product 


I Vi.* - 1 1 Vtfik\‘ 

The first factor is different from zero since 5 is, by hypothesis, not tangent to 
the extremals. The second is also different from zero, since the functions ri'(a, 0) 
are solutions of equations (38-13); and the product of the determinant 
by the functional determinant of the first members of equations (38 - 13). with 
respect to the variables t)' is, therefore, the functional determinant of the sec- 
ond members of (38-13) with respect to the variables 0 which is the identity 
determinant. 

It is evident that the family Yi{x, a, 0) is identical with the family (37-1) 
and that in the repon F the equations y< = F,(*, a, 0) therefore have single- 
valued solutions a,(x, y) on which the determinant | Yiak(x, a, 0) | is different 
from zero. From implicit function theorems it follows, then, that the equations 

(38-15) y.-» F,(*, o, i5) 

have solutions a(x, y, 0) with continuous second d^vatlves and \Yiak\ ^ 0 
for values («, y, 0) sufficiently near to the values (x, y, 0) belonging to F. The 
slope functions of the famUy (38-14) in F have the values 

(38-16) Pt(x, y, 0) - Y'dx, a (*, y, 0) . 0] . 

It follows readily from equations (38-13) that on S the integral I* of the 
family (38- 14) for fixed -values 0 is independent of the path, since its integrand 
is dU + 0idat. Hence the slope functions Pi(x, y, 0) form a field in F for each 
fixed set of values 0. 
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The value of the integral /* taken from a fixed point (xo, yo) to a variable 
point {x, y) in F is a function W{x, y, ff) defined by the equation 

Wix, y, fi) « + 

with the arguments x, y, y' in the integrand replaced by x, y, pix, y, 0). Since the 
fiirst derivatives of W with respect to the variables x, y have the values 

(3817) 

it is clear that IT is a solution of the Hamilton-Jacobi equation (38 *8). 

To show that the determinant \ | is different from zero, we note first, 

from the second equation (38 - 17), that it is equal to the product l/v^yj \pkfii \ , 
the first factor of which is different from zero by hypothesis. To prove the sec- 
ond factor different from zero, we differentiate with respect to jSy the equa- 
tions (38 *15) and (38 *16), with a replaced by a(x, y, 0), The results are 

0 ^ y "f" Y ifi> , 

Pifij “ ^ *®*®*^l* • 


If there were constants Cj not all zero such that pxfijCi = 0, the last equations 
above would imply the equations 

0 = P Y 0^C j , 

0 = P i’\‘ Y 0jC i 

which are impossible, since the determinant (36 *2) for the family Pi(rc, a, 0) is 
different from zero. Hence we have proved the existence of a function W(x, y, 0) 
in terms of which the extremals are defined by equation (38 *11). 

It can be proved, by a method similar to the one described above, that, if a 
region F of xy-space is simply covered by the extremals through a fixed point 
{xq, yo) it will also be simply covered by the extremals through each neighbor- 
ing point (xo, y 0 + (3) . The invariant integral in the n-parameter family of fields 
so determined over F is also a function W(x, y, (3) effective for determining the 
extremals by means of equation (38 *11). 

39. The theory of the second variation. The theory of the second variation 
for the problem in (n + l)-dimensional space differs from that for three-space 
in notation only. Let £12 be a non-singular extremal arc for the problem in the 
space of points (x, yi, . . . , yO formulated in Section 35. 

The extremal arcs for the accessory minimum problem along £ 12 , formulated 
in Section 36, are solutions of the accessory differential equations (36 *5). For 
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two sets of variables x, % i/' and x, u, vl the mtegrand 2w of the second variation 
along £u satisfies the identity 

«» «') +»»<«»<(*» n> »»') n, n') 

because it is a homogeneous quadratic form in the variables n, i)'. If we use the 
notation ^ 

we find easily the further relation, analogous to (23 *2), 

(39*1) —UiJi(ri) u) , 

where 

(39*2) Lir, »?, >?'). 

From the relation (39- 1) it follows that for every pair of solutions i;, u of the 
accessory equations /<(iy) = 0 the expression L(iy; u) is a constant. If this con- 
stant is zero, the solutions ri, u are said to be conjugate. 

Let «»(*), Uikix) (i = 1, . . . , ») be n + 1 solutions of the accessory equa- 
tions, the last n of which are conjugate in pairs and have their determinant 
A = 1 Uik(x) 1 different from zero on the interval xi^ x^ Xi. Then the n-param- 
cter family of accessory extremals 

(39-3) Vi^Ui + akUik (t«l, ...,n) 

forms a field for the accessory minimum problem in the portion of (x, y)-space 
bounded by the hyperplanes x — xi and x * xt. This is a consequence of the 
fact that the expressions 

dm dm 

ddi ddk ’ 

analogous to (37 * 8), are the values of L(ui] Uk) formed for the conjugate pairs 
of solutions Ujif Ujk and are zero for every pair f, k. The slope functions iri(«, rj) 
of the field are found by solving the equations (39 *3) for the parameters dk 
and substituting in the expressions 

* «< + a* Uik . 

They have the values 

1 Ui Uii 

(39-4) wiix, ri)^j 

^ Uk-fJk Uki 

From this result the differences ni — n are seen to have the form 

^ yik-Uk Uki 
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We can now see, without difficulty, by the method used in proving Theorem 
23*2, that for an arbitrary admissible arc i}(x) joining the end-points of the 
accessory extremal arc «(x) the second variation has the value 

(3 9- 5) /» (i;) = U («) + jT*’ (n< - T.) (ni - irt) dx . 

This result, analogous to that stated in Theorem 23*2, is a consequence of the 
remarks in the last paragraph above and of the analogue in {x, ?y)-space of the 
integral formula of Weierstrass of Theorem 19 *1, since the £-function of Weier- 
strass for the integral h{i{) in (36 *3) is 

(39*6) £2«(«, 1?, 17', ir) = — IT,) — TT*) . 

To obtain the formula (39 *5) it was supposed that there exist n solutions 
Uxk{x) (i^ = 1, . . . , n) of the accessory equations which are conjugate in pairs 
and have their determinant ] Uik{x) 1 different from zero on the interval If 
the non-singular extremal arc £12 satisfies the condition IV', so that the interval 
3C1X2 has on it no value conjugate to X\, then there is alwa5rs such a set of n solu- 
tions, as can be seen by an interpretation for (n + l)-space of the proof of 
Theorem 23 *3. Moreover, if £12 also satisfies the condition III', the quadratic 
form (39*6) for £2® is positive definite at every value x on the interval 
xiX 2 x and it follows from formula (39 • 5) that the value h{u) of the second varia- 
tion on an accessory extremal u is less than the value of 1% on every other admis- 
sible arc yii{x) in (x, i7)-space joining the ends of w<(x). If the value x^ is conjugate 
to X\ and there is no conjugate value between xi and X2, then at least h{u) ^ 
^2(17), provided that the condition III' is satisfied. These sufficient conditions 
for the accessory minimum problem are analogues of those stated in Theorem 
23*3 and Corollary 23 *1 and are proved by methods quite like those of Sec- 
tion 23. 

A very useful result in the theory of the second variation for the problem in 
three-space is the formula (23 *10). The corresponding formula for the problem 
in (w + l)-space is equally important. If the n accessory extremals Uxk{x) 
(ft = 1, . . . , n) in equations (39*3) are conjugate in pairs and have their 
determinant | «»jb(x) | different from zero on the interval X1X2, and if the func- 
tions «,(x) define another accessory extremal, then for every set of admissible 
variations i7i(x) the formula 

(39 7) -2w(x, w, «') w') 

+ 1 ? — ir-tt') ] +2w(x, 0, i7'-ir) 


is valid, where the functions are the slope functions (39*4) of the family 
(39*3) with the arguments x, iii{x). The proof of this formula is quite like that 
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of the formula (23*10) for the three-dimensional case, though different in nota- 
tion. When Ui{x) ^ 0 (t » 1, . . . , n), the formula takes the somewhat simpler 
form 

(39-8) 2(a{x, 17 , ^ (^1 n, x) +2«(ir, 0, ly'-x). 

It should be noted that the last terms in (39 -7) and (39*8) have the values 

2(a{x, 0 , 17 ' — x) = ( 17 * — Xi) — ir*) . 

For the second variation the canonical variables x, 17 ,, f < are defined by the 
equations 

(39-9) f. = w,;.(:r, 17 , 17 '). 

Since the determinant l/y^yjkl is different from zero along the non-singular arc 
£ 12 , these linear equations in the variables i 7 « and have solutions 

(3910) i 7 i=nax, 17 , r) 

which are linear and homogeneous in the canonical variables 17 ,-, f The Hamil- 
tonian function for the second variation is then 

(3911) ft{x, V, = {•<-«(*, 17 , n). 

It is quadratic and homogeneous in the variables 77 ,*, f The canonical accessory 
equations, analogous to the equations (38-4) for the original problem, have 
the form 

09.2) 

with second members which are linear and homogeneous in 77 * and ft. Accord- 
ing to well-known theorems concerning linear differential equations, these equa- 
tions have a unique solution rji{x), f <(x) determined by each set of initial solu- 
tions ri^{xo)j f »(*o). In particular, if vii^xo) ~ tiM = 0, the corresponding solu- 
tion is Tji{x) s s 0. For two such solutions 17 ,, f , and »» the expression 

(39- 13) 

is a constant, as one verifies readily directly from equations (39 - 12) or from the 
remark following equation (39 -2). If this constant is zero, the two solutions are 
conjugate. 

A system of n linearly independent accessory extremals Uik{x)y Vik(x) (* = 1, 
...,») which are conjugate in pairs and have the determinant \uik(x)\ not 
identically zero is called a conjugate system. Such systems occur in many parts 
of the theory, some of which have already been noted above. We may represent 
by {/(a^) and 7(ap) the matrices of elements Uik{^ and Vik{ii^i » Uk^ ui), 



PROBLEMS IN THE PLANE AND IN HIGHER SPACES 101 

where Uk stands for the set (t = 1, . . . , w). From what has been said in 
the preceding paragrapTi, it follows that to every pair of matrices of initial 
values t/(a;o), V{xo) there corresponds a pair of matrices U{x)y V{x) whose col- 
umns Utk{x), Vik{x) (ft = 1, . . . , n) form n solutions of the canonical accessory 
equations (39-12). These n solutions will form a conjugate system if, and only 
if, the product of the transposed matrix U{xo) by V{xq) is symmetric. This fol- 
lows because the elements of the matrix Fl/ — UV are the constants (39 -13) 
belonging to the pairs of solutions va {k 1, ...,»). In particular, we 
secure conjugate systems if we take U(xo) = I, V{xq) = 0, or U(xo) = 0, 
V{xq) = /, where 1 is the identity matrix. In the former case the matrix U{x) 
will certainly have its determinant \U(x)\ = \uik{x) j different from zero near 
X = Xq. We shall see in Chapter VI that the possibility of constructing a con- 
jugate system with determinant different from zero on an interval including a 
given value xq has important consequences. 



CHAPTER V 

PROBLEMS IN PARAMETRIC FORM 

40. Introduction. For the problems of the calculus of variations discussed 
in preceding chapters, the arcs considered have all been taken in non-parametric 
form. In the space of points (oc, 3 ^ 1 , ... , yn), for example, they were defined by 
single-valued functions of the form yi{x) (xi ^ x ^ X 2 ] i ^ ly • • . t n). This is 
in contrast to parametric arcs which have defining single-valued functions of the 
form x(t), yi{i) h) i ^ Evidently every non-parametric 

arc has a parametric representation which may be secured by setting x ** t; 
but a parametric arc is representable non-parametrically only when the func- 
tion «(/) is properly monotonic. Since the coordinates yi, . . . , yn all play 
quite equivalent roles in the parametric representation of an arc, it is customary 
to formulate parametric problems of the calculus of variations in the space of 
points (yi, . . . , yn) in place of points (x, yi, . . . , yn)- Frequently, when con- 
venient, we shall, as before, use the notations y and y' for the sets 

y “ (>^11 • • • y yn) , / = (/l, • . . , yn) • 

A function of parametric arcs y»(/) (/i ^ ^ f = 1, . . . , ») can have 
geometric significance only if its value on each arc is the same for all parametric 
representations of the arc. The integrals to be minimized in parametric problems 
of the calculus of variations are therefore supposed to have values independent 
of the parametric representations of the arcs along which they are taken. It 
turns out, in Section 41 below, that their integrands are functions /(y, y') which 
are positively homogeneous of order one in the derivatives y[ = dyi/dt. The 
presence of this homogeneity property complicates the theory considerably, 
and its consequences appear frequently. The more important formulas are 
listed for convenient reference in Section 43 below. 

There are many problems of the calculus of variations for which a satisfac- 
tory discussion can be made only if they are taken in parametric form. A simple 
example of a problem of this sort is that of finding in the class of arcs joining 
two fixed points one which has a minimum length. It should not be inferred, 
however, that the theory of problems in parametric form is more important 
than the theory of non-parametric problems, or vice versa. Each has its appli- 
cations in the places adapted to it. The variables for a problem in dynamics 
concerning n bodies, for example, are the time / and the 3n coordinates {xiy yi, z<) 
(f « 1, . . . , «) of the n bodies of the S 3 rstem under consideration. The move- 
ment of the system is described by 3n functions Xi{t), yi{t), Zi{t) (f * 1, . . . , n) 
in which t increases monotonically and which are extremals of a Hamiltonian 
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integral analogous to that of Theorem 30 - 1 for the problem of one body. In the 
problem with n bodies ‘the roles of the variables /, Xi, yi, Zi are analogous to 
those of Xy yi, ^ yn in the preceding chapter, and the problem is evidently 
essentially non-parametric. 

Many of the results desirable for parametric problems are deducible imme- 
diately from corresponding theorems for the non-parametric case. The places 
where new arguments seem to be necessary are in the discussions of the ex- 
tremals, of the analogue of the Jacobi condition, and of the sufficiency theorems. 
The methods adopted here for the treatment of these and other parts of the 
theory are the ones which, at the time of writing, seemed most convenient. In 
the last section of this chapter. Section 54, a sketch of other possible theories 
is given with some comments on their applicability and relative convenience. 

41. Parametric representations of arcs. The arcs to be considered will be 
defined in (yi, . . . , y„)-space by functions of the form 

(4M) yi(t) (/ig/g/ 2 ; i- 1, . . . , n). 

Such an arc will be called regular if the fun'^tions y»(/) have continuous deriva- 
tives yl on the interval hh and if the sum yiyi is everywhere different from zero. 
Thus, a regular arc has no singular points. In the following pages derivatives 
with respect to the parameter t will be designated sometimes by primes and 
sometimes by subscripts, and a repeated index will always indicate a sum, as 
is customary in tensor analysis. 

The parameter may be changed by a substitution of the form / = ^(r), where 
V>(t) is continuous and has a continuous positive derivative on an interval n ^ 
r ^ 72 and takes the end-values h = ^(ri), fe = ^( 72 ). The transformations of 
parameter used in the following sections of this chapter will all be of this sort. 
In particular, a transformation with a negative derivative ^'( 7 ) will not be per- 
mitted. Thus, all of the arcs considered are directed arcs, each with a positive 
sense determined by the direction in which the parameter increases along 
the arc. 

In the ^y-space the integral of the calculus of variations would normally have 
the form 

(41-2) f 'f{ty y, y')dt. 

But such an integral will, in general, change value when the parameter is 
changed; and the value of such an integral therefore cannot, in general, repre- 
sent a number associated with an arc which has a purely geometric significance 
independent of the particular parameter chosen to represent the arc. The 
length of arc, for example, is a number quite independent of the parametric 
representation. Thus, if we are to have a theory effective for geometric problems, 
we must impose upon the integrand function /(/, y, y') such properties as will 
make the value of the integral independent of the particular parameter used. 
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If we apply to the arc (41 • 1) the simple parametric transformation / = t + c 
the values of the integral (41 ■ 2) in terms of the parameters t and r will be 

(41-3) f fit, y, yt)dt, f y, yrldr, 

where the derivatives of y are represented by subscripts and the limits corre- 
spond by the transformation / = r + c. If these values are equal, then by 
differentiating with respect to t we see that at each point of the arc 

/(^ y, yd = fir, y, yr). 

But, since yi = yr, we have 

fit, y, yd = fit- c, y, y,) , 

and this equation must be true for all values c. It follows readily that, if the in- 
tegral (41 • 2) is to be independent of the parametric representation for all arcs 
(41 • 1), the integrand function must not contain the parameter t explicitly. 

A further property of the integrand function can be determined by the use 
of the transformation / = xt in which k is supposed to be a positive constant. 
In terms of the parameters / and r the integral (41 • 2) has again the values 
(41-3), in which t and t are now supposed not to occur explicitly in /. If the 
integrals are equal, differentiation with respect to r and use of the relation 
yr = gives the equation 

/(y, ydK^fiy, Kyt) (k>0). 

Theorem 41 1. Necessary and sufficient conditions for an integral {41 '2) to 
have values independent of the parametric representation on arcs of the type {41 • 1) 
are that the integrand function f is positively homogeneous in the variables y' and 
does not contain the parameter t explicitly. 

The sufficiency of the conditions, for parametric transformations / = <p{t) 
of the type described above, follows readily from the fact that the integrals 
(41-3) are equal at the initial value r = n where t = /i, and from the easily 
established equality of their derivatives with respect to r. In the following sec- 
tions the integrals (41 • 2) to be considered are all independent of the parametric 
representation. 

42. Formulation of the parametric problem. In view of the preliminary 
remarks made in the last section, we shall consider in this chapter an integral 
of the form 

(4 21) / = /*V(y, y)dt, 

taken along arcs defined parametrically in the n-dimensional space of points 
y *= (yu • • • > yn) by functions 

(42-2) yiU) 


(/l ^ ^ ^ l2f t*®l| •••! ft) • 
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We shall suppose that there is an open region R of 2n-dimensional points (y, y') 
with the properties: 

a) R contains only points (y, y') for which yiyi 0; and if a point (y, y') is 
in i?, so are the so-called related points (y, icy') for all values ic > 0; 

b) f{y, y') is continuous and has continuous partial derivatives up to and 
including those of order four in R; 

/(y> y) for all points (y, y') in R and all values k > 0. 

An admissible element (y, y') is one which is interior to the region R. An 
admissible arc (42 • 2) is one which is continuous and consists of a finite number 
of regular sub-arcs and which is, further, such that all of its elements (y, y') 
are admissible. 

The problem is then to find in the class of admissible arcs joining two given 
fixed points 1 and 2 in y-space one which minimizes the integral /. 

For this chapter on problems of the calculus of variations in parametric 
form the following definitions will be found useful. The set of points (y, y') 
belonging to an admissible arc E includes, by definition, not only the set of points 
Ly(0i yil)] (^1 ^ ^ belonging to a particular parametric representation of 

the arc but also all of the related points [yd), icy'(/)] (#c > 0). It is evident that 
the set contains all of the points of every parametric representation of the arc. 

The symbol iV will frequently be used to designate a neighborhood of the set 
of points (y, y') on an admissible arc £. Such a neighborhood is defined to be a 
set of points (y, y') which, if it contains a point (y, y'), contains all the related 
points (y, xy') (#c > 0), and which, furthermore, contains in its interior all the 
points (y, y') belonging to the arc E, 

If a set S of points (y, y') contains all of its related points, then for every 
point interior to S all of the related points are also interior to 5. This follows 
because the 5-neighborhood of (y, <cy') is interior to S whenever the €-neighbor- 
hood of (y, y') is in 5, provided that 5 is less than € and €k. Hence, to construct 
a neighborhood N of an arc E one may take an €-neighborhood of a particular 
parametric representation of E and extend it to contain all of its related points. 

A normed set (y, y') is, by definition, one for which y/y' = 1. When the 
length of arc 

f \yWi) ^^^dt 

along an arc E is taken as parameter, then every set (y, y.) belonging to that 
particular parametric representation of E is normed. 

43. Consequences of the homogeneity relation. There are a number of conse- 
quences of the homogeneity property of the integrand function /(y, y') which 
are essential in the developments of the following pages and which can be de- 
duced with little difficulty. It will simplify the theory if they are proved in 
advance. Typical properties of this sort are given in the list below. Others which 
will be needed are omitted because they can be derived from those in the list by 
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very simple aiguments. The functions Tj, E, Q, 2ci>, are the analogues of 
others wUch have been used in the preceding chapters, and are defined by the 
first equations in which they occur. In particular, the equations <= 0 are the 
Euler equations of the problem, and their solutions are extremals. The tensor 
analysis notation for a sum is used freely here, as in the preceding chapter and 
in later sections. The list of homogeneity properties to be considered is as 
follows: 

(431) /(y, xyO =K/(y, y') (*>0). 

(43-2) y'j,i=/. 

(43-3) /,,(y, Ky')^Kf,,(y, y') , U^(y, Ky')=/,i(y. y') 

fy'mi (y. Ky') *= (y. y') • 

(43 4) r. = , y\T, S 0 in y, y'. y" . 

(43-5) £(y, y' Y')=fiy, Y')-f(y, y') - (Yi- y'i) f,',(y, y') 

= fiy, Y') - Y'ify'^iy, y') 

= Y'AMiy, Y') -/,i(y, y')]. 

(43-6) Qiy, y'\ v, f) =/»',,i(y, y') Vitk 

Qiy> y'i py’> f) = o, 

G(y, y'; v + py', f) =6(y, y'; n, f). 

(43’7) 2(i; (/, If, ifO “ */»■*?* “h ^ 

y<"ft- (if) = —ViTi = 0 along an extremal , 
d 

Ji (py') =» ^( pTj) = 0 for all p (/) along an extremal . 

TTie proofs of these relations are simple. The first equation (43-2) follows 
from the homogeneity property (43-1) by differentiating with respect to k and 
setting X *« 1. The second equation (43 • 2) follows from the first by differentiat- 
ing with respect to y*. Equations (43 • 3) are deduced from (43 • 1) by differentia- 
tion with respect to the variables y, y'. To prove (43-4) we have 

/trjtg^y* "I" /«{i4y* “ /*< I 
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from which the relation y'tTi = 0 follows with the help of the relations (43-2). 
The same relations justify the last two forms for the £-function and the for- 
mulas in (43-6) m which the symbol /»y' stands for the set (pyi, /yy^. To 
prove the third equation in (43 • 7) we have 

y'iJi ^ ^ (/<«»;) - yiun'i - /.«»,■ , 

/»i»* j ~ f y'pijfVk 1 

and from equations easily deducible from (43 - 2) it follows that 

yWi= Wv*. 


^ (vkfti) + if'kTk . 


The desired equation follows at once from these results. The last equation in 
the set (43 • 7) is a consequence of the fact that when in = p(t)yi we have 

dpfy. Pdfy’f 


It is evident from equations (43 -2) that the determinant |/»'iyil of the bi- 
linear form Q is equal to zero. The determinant 


(43-8) 


R = 


fy'ptk 

P 

yk 


/ 

yi 

0 


is, however, not always zero; and an admissible arc on which it remains different 
from zero is called a non-singular arc for our problem. 

Theorem iZl, Alan admissible element (y, y') the following three statements 
are equivalent: (/) the determinant {43 '8) is different from zero; {2) the matrix 
llfyiVkll Aos rank n — 1; and (J) the matrix l|fyiVky.11 has rank n. 

We may prove this by showing that (1) implies (2), (2) implies (3), and (3) 
implies (1). If the determmant (43 -8) is different from zero, the matrix of its 
first n rows must have rank », and this cannot be true if the matrix in (2) has 
rank less than » — 1. Hence statement (1) implies statement (2). If the matrix 
in (2) has rank » — 1, then the equations whose coefficients are the first n col- 
umns of the matrix in (3) have, from (43 *2), the solutions py' and only these. 
But the set py' does not satisfy the linear equation whose coefficients are the 
last column of the matrix in (3) unless p = 0, and hence this last matrix must 
be of rank ». Consequently, statement (2) implies statement (3). Finally, sup- 
pose that the matrix in (3) is of rank n. The only solutions of the linear equa- 
tions whose coefficients are its rows have the form py', 0; and these do not 
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satisfy the linear equation whose coefficients are the last row of the determi- 
nant (43-8). Hence this determinant is different from zero. We see, therefore, 
that (3) implies (1), and the theorem is proved. 


44. First necessary conditions for a minimum. The results which have 
been explained in preceding pages for non-parametric problems in the space of 
points {x, yi, . . . , yn) justify at once the formulation of the following condi- 
tions in the parametric case provided that we replace the independent variable 
X by i. One can easily see, with the help of the formulas of Section 43 above, that 
these conditions are satisfied for all parametric representations of an arc E if 
they are satisfied for one. 

I. An admissible arc E is said to satisfy the condition I if there exists a set of 
constants ci such that the equations 

fy'i-f + 

hold along E. 

Corollary 44 • 1 . 0« every sub-arc of an admissible arc E without corners which 
satisfies condition 1 the functions iy[ have derivatives and 




Corollary 44*2. At every value t defining a corner of an admissible arc E 
which satisfies condition 1 the right and left limits of the function fyf[y(t), y'(t) 
are equal. 

The third corollary, analogous to that in Sections 6 and 35, with its proof, 
must be modified slightly for the parametric case, as follows: 

Corollary 4A ' 3. If on a sub-arc without corners of an admissible arc E which 
satisfies the condition 1 the determinant {43 8) is different from zero^ then the func- 
tions yi(t) defining E have continmus mth derivatives, provided that the integrand 
function f(y, y') has continuous mth derivatives in a neighborhood of the elements 
(Xi yO on the sub-arc in question, and provided also that the parameter i is the 
length of arc. 

To prove this we may consider the n + 1 equations 

,,, .X «] y' , 

(44- 1) 

«*«*- 1 = 0 , 

which are satisfied, according to the condition I, by the functions 
/ a 0, provided that / is the length of arc on E. Along this solution the func- 
tional determinant of the first members of equations (44- 1) with respect to the 
variables Uu, I is twice the determinant (43-8) defined above. Furthermore, the 
first members of the equations (44-1) have continuous derivatives with respect 
to the variables t, ut, 1. Hence the usual theorems concerning implicit functions 
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show that the solutions Uk = >>*(/), / = 0 have continuous derivatives near 
every value of / defining a point between corners of the arc E at which the de- 
terminant (43*8) is different from zero. The argument is similar to that of 
Section 6. Reapplying the implicit function theorems, we see, as in that section, 
that the functions yi{t) defining the minimizing arc in terms of its length of arc 
as parameter must have continuous wth derivatives near a value t at which 
the determinant (43 *8) is different from zero, when the function / has con- 
tinuous tnXYi derivatives. An argument similar to that of Section 6 shows the 
existence of forward or backward derivatives of the wth order near a value t 
defining a comer element (y, y') of E at which the determinant (43 *8) is differ- 
ent from zero. 

The conditions analogous to those of Weierstrass and Legendre are also im- 
mediate analogues of the conditions stated for the non-parametric case. If they 
are to be strengthened for the sufficiency proofs by leaving out equality signs, 
they must be modified slightly, however, as indicated in the following state- 
ments: 

II. An admissible arc E is said to satisfy the condition 11 of Weierstrass for 
the parametric problem if at every element (y. y') of E the inequality 

Eiy, y', ¥') ^0 

holds for every admissible element (y, Y') with Y' 5 *^ icy' (ic > 0). 

III. An admissible arc E is said to satisfy the condition 111 of Legendre for 
the parametric problem if at every element (y, y') of E the condition 




hMs for every set w with v ^ py'. 

We can never exclude the equality sign in these conditions without the re- 
strictions F' icy', TT ^ py'. For from the formulas (43*3) and (43-5) it fol- 
lows that the value of the £-function in condition II vanishes when F' = icy'. 
Similarly, the formulas (43-6) show that the value of Q in condition III van- 
ishes when T = py', the factor p being either positive or negative. 

Theorem 44* 1. Every minimizing arc for the parametric problem with fixed 
end-points, formulated in Section 42, must satisfy the conditions I, II, III. 

The conclusion of this theorem is an immediate consequence of the proofs 
of the similar result for the problem in {x, yi, . . . , yn)“Space if we change the 
independent variable x to /. 

Lemma 44 * 1. If an admissible arc E satisfies the condition III', then there is a 
neighborhood N of the elements (y, y') on E such that for every element (y, y') in N 
the condition 111' is also satisfied. 

For the proof of the lemma we can see first that the condition III' is equiva- 
lent to saying that the quadratic form 




iyiUy 

(3,'y') 8/2 


(44*2) 
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is positive definite at each element (y, /) on E. This form is evidently positive 
when the variables are not all zero and are proportional to the variables yi, 
as one sees with the help of the second formula (43 ■ 2). If III' is satisfied, it is 
also positive for all other non-vanishing sets ft. Conversely, if the form (44*2) 
is positive definite, then the condition III' must hold, since for each set w 9 ^ p/ 
there exists a non-vanishmg set — pyi such that yt^i ■» 0 and, further- 

more, such that the form (44 ■ 2) is equal toQ(y,y-,ir,T), on account of the third 
formula (43 ’fi). 

If the form (44-2) is positive definite for all sets (y, y') on the arc E, then it 
remains positive definite for all sets (y, y'} in a sufficiently small neighborhood 
of those on a particular parametric representation of E. The extension of that 
neighborhood to contain all its related sets is a neighborhood N of the t}q>e 
described in Section 42. It has the property that the condition III' is satisfied 
for all sets (y, y') in N. 

CosotLASY M - 4. If at each dement (y, y') of an admissible arc E the condi- 
tion HI' is satisfied, then the inequality 

Eiy, y', F') >0 

holds for all in-dimensional sets (y, y', Y') hasting Y' 9 ^ sy' {k> 0) and lying 
in a properly chosen neighborhood No of the sets (y, y', Y') = [y(t), y'(t), y'(t)] 
belonging to E. 

To prove this we use the formula 


(44-3) 


2£(y, y', F') = 

( f; - y'i) ( f; - yl) fy'^, [y, y' + « ( F' - y') ] 


( 0 < < 1 ) , 


analogous to the expansion (9 - 3) for the non-parametric problem in three-space, 
and consider the set S of points [y(t), y'(<)] (^i ^ t ^ <t) belonging to a particu- 
lar parametric representation of the arc E. There will be a 2c-neighborhood of 
the set 5 in the neighborhood N of Lemma 44-1 where III' holds. When a 
3»-dimensional point (y, y', F') is in the c-neighborhood of the points 
b(^)> y0)> y(l)] belonging to E, the correspondmg 2»-dimensional point 
[y', y' -f-0(y' — y')] will lie in the 2c-neighborhood of S and will therefore also 
lie in iV as can easily be verified. The differences F<' — yl can not be propor- 
tional to the values y<' -f- ®(F<' — yO when « is sufficiently small, as will be 
shown in the following paragraph; and the expression (44-3) is therefore posi- 
tive. Hence the c-neighborhood described above, extended to contain all of its 
related sets (y, xy', XF') (x > 0, X > 0), has the properties demanded in the 
corollary for the neighborhood iVo. 

If the differences F{ — yl were proportional to the values y/ -f- fl(F{' — yi), 
we should also have Yi pyi for some p. For the points (y, y', F') in the 
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«-neighborhood described above the sums y/y,- all have a positive minimum m, 
and absolute value | F,' — y[\ < e. Hence we should have 

Ip- 1| l/.l <«. 

which would imply that p > 0 when € is sufficiently small. But this would con- 
tradict the hypothesis F' 9 ^ k/ (k > 0) of the lemma. 

Lemma 44-2. If the condition III holds at an admissible element (y, y'), then 
a necessary and sufficient condition that II V holds is that the determinant 

/vivi yx 

yk 0 

shall be different from zero, or^ in other ivords, that the element (y, y') be non- 
singular. 

The condition is necessary since, if the determinant were equal to zero, there 
would be constants Tri, I not all zero satisfying the linear equations 

(44-4) + y>.= 0. 

On account of the second relation (43 -2) it would follow that / = 0, so that the 
values wk would not all be zero. Furthermore, the set tt would not be proportion- 
al to the set y' because of the last equation (44*4). But from the first of these 
equations it would then follow lhat ff.ylTiTk = 0, which shows that the condi- 
tion III' cannot hold when the determinant is zero. 

The condition is sufficient, since if the determinant (44-4) is different from 
zero it follows, by arguments like those used in proving Theorem 43 • 1, that the 
same is true of the determinant of the quadratic form (44-2). This non-nega- 
tive form cannot, therefore, take its minimum value zero at a set of values 
f 5 *^ 0 and must be positive definite. 

45. The extremals. In this section we shall consider the definition of an 
extremal and the analogue for parametric problems of the imbedding theorems 
of Sections 7 and 36. An extremal arc is an admissible arc without corners for 
which there are defining functions y,(/) which have continuous second deriva- 
tives and which satisfy the P^uler differential equations 

(451) ~ fy\yky^ fvWky^ “■ ^ • 

A non-singular extremal arc is one along which the determinant R in (43 • 8) is 
different from zero. Such an arc is imbedded in a {2n — 2)-parameter family 
of extremals, as indicated in the following theorem: 
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Theorem 45 *1. The Imbedding Theorem. Through a given non-singular 
admissible element (y, yO (17, 17') there passes one and but one extremal arc. 
Furthermore^ every non-singular extremal arc Eo is a member for parameter values 

^10 ^t^ /20 I ar^ GrO I “ ^rO (^~lf 2 j . , . ^ W“~l) 

of a (2n — 2)-parameter family of extremals 

(45*2) y% (/j Gi| • • • 9 Gn— 1 | 5i9 • • • ^n—l) “ 3^1 

/t?f wAtcA the functions yi, yit continuous partial derivatives of at least the 
second order in a neighborhood of the values (t, a, b) belonging to Eo and for 
which the determinant 

yi 0 

// / 

yi yi 

is different from zero along E. 

To prove this theorem we may use a device which seems artificial. It is sug- 
gested by, but is somewhat different from, the procedure followed in the study 
of the parametric problem as a problem of Lagrange, a type of problem which 
will be considered in some detail in a later chapter. Let us consider the equations 

(45-4) - /vi + ^ /< = 0 . y'kyk = 0 , 

the last of which insures a parametric representation for which the sum is 
a constant. These equations are linear in the variables y", /; and the determi- 
nant of coefficients of these variables is the determinant R in (43 *8), which is, 
by definition, different from zero at every non-singular element (y, y'). Hence 
the equations (45*4) have solutions 

(45*5) y*' = A*(y, y') , / = L(y, y'), 

whose second members have continuous partial derivatives of at least the sec- 
ond order in a neighborhood of a non-singular admissible element (y, y') = 
(17, 17') or in an ^-neighborhood of the set of elements (y, y') belonging to a par- 
ticular parametric representation of a non-singular extremal arc £, since the 
integrand / has continuous derivatives of the fourth order. The function L is 
identically zero, since from the equations (43*4) and the first equation (45*4) 
it follows that y/r< -f iy/y< = Ly/y< = 0 . Consequently, every solution of the 
first n equations (45*5) will satisfy the equations (45*4) with / s 0 and will 
be an extremal arc with y/y< = constant along it. 

There is one and but one extremal arc through a non-singular admissible 
initial element ( 17 , ri'). This follows because the equations (45*5) have one and 
but one solution yi(/) through a given non-singular element (<,y,y') == ( 0 , 17 , 17 ') 
and because every extremal arc yM through the initial element ( 17 , 17 ') is repre- 


(45*3) 


y.«r y*br 

y'iar y'th. 
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sentable, after a change of parameter, by the unique functions y<(/) which satisfy 
the equations (45 • 5) and pass through the element {t, y, y') = (0, ly, tj'). If n is 
the value defining the ^element (iy, on the extremal yi(r), the parametric 
transformation needed to justify this statement is 

= f hiryir) . 

Let us suppose now that the length of arc is the parameter / in terms of which 
the non-smgular extremal arc £o of the theorem is expressed, so that the equa- 
tions y/y< = 1 and ylyl* = 0 are identities along Eo. The existence theorems for 
differential equations tell us that through each element (/, y, y') = (t, 17, 17') 
in a neighborhood of the elements of this type belonging to £0 there passes a 
solution of the first n equations (45 -5) defined by equations of the form 

(45*6) yt = V!>tO, T, 17, »?')• 

According to the existence theorems, the functions tpu have continuous par- 
tial derivatives of at least the second order in a neighborhood of the sets (/, t, 
17, 17') belonging to Eo, since the functions -4*(y, y') in equations (45 *5) have 
such derivatives. 

The curves of the family (45 *6) depend upon 2 n + 1 parameters r, 17, 17' but 
their projections in y-space form a (2n — 2)-parameter family, as we shall see 
in a later paragraph. To define this ( 2 n — 2 )-parameter family (45 - 2 ) with 
the properties described in the theorem, let (to, 170, be a particular fixed ele- 
ment on Eo, with 770 standing for the set (7710, . . . , 77 no) and with a similar in- 
terpretation for 77o. Let 77^(01, . . . , (Zn-i) (i = 1, ...,») be » functions of 
« — 1 parameters ar (r = 1, . . . , « — 1) having continuous derivatives of at 
least the second order near a set = (dio, . . . , «n-i, 0) at which 77 ,(flo) = i7<o, 
and such that the determinant 

(45*7) liia^iao) Viol 

is different from zero. Furthermore, let tiiibi, • • . , K-i) be n functions of » — 1 
parameters Jr (r = 1, . . . , w — 1) having continuous derivatives of the sec- 
ond order near a set Jo = (Jioi . • . , *n-i. 0) at which riKbo) = 77/0, satisfying 
the equations 77/17,' = 1 and having their matrix |l77/6X^o)ll of rank » — 1. We 
shall prove that the family 

(45*8) yi-fpilh To, ri(a), 77' (J) ] = y,(/, a, J) 
has the properties of the theorem. 

To prove this we see, first of all, that the family (45 • 8) contains the arc Eo 
for values (/, a, J) satisfying the conditions /lo S ^ feo, a = oo, J = Jo, where 
ho and ho are the parameter values defining the ends of Eo. Furthermore, it 
follows readily from the preceding paragraph that the functions yi(/, a, J) and 
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their derivatives have the contmuity properties described in the theorem. Be- 
cause of the identities 

Vi^Viir, T, V> = T, V> v') 

and their easily deduced consequences 

Sik = T, V, v') > 0 = <?<«*(»■, T, V, v') > 

0 = Vivi(r, T, ij, 1?'), = T, ij, »>0i 

one sees that at the values (/, a, b) = (to, oo, to) the determinant (45 • 3) has 
the value 

>j.«p (aO 0 Vio 0 

0 yut ViO 

By moving the next to last column over the » — 1 preceding ones, it follows 
that, except for sign, this determinant is the product of (45 *7) by 

\lib^(bo) i7<o|. 

This last determinant is also different from zero. Otherwise there would be a set 
of constants Cr, d, not all zero, such that 

CrVtbj. (W +dl7,o= 0- 

If these equations are multiplied by i7,'o and added, we find d = 0, since differ- 
entiation of the identity = 1 implies that rjimbr = 0. But this would 

imply also Cr = 0 (r = 1, . . . , » — 1), since the matrix |l7?<6r(fto)|l is, by hy- 
pothesis, of rank n — 1. Hence the determinant (45 *3) for the family (45 -8) 
is different from zero at the point defined by / = to on the arc E, We shall see 
in connection with the proof of a later corollary (Corol. 48 1) that this de- 
terminant is either everywhere equal to zero or everywhere different from zero 
on the arc E. Thus the theorem is proved. 

When the functions (45 *2) are substituted in equations (45 -5), it is evident 
that the second derivatives ym also have continuous partial derivatives of the 
second order. Furthermore, if the integrand function has continuous wth deriva- 
tives, then the functions Akiyyy') in equations (45 *5) have continuous partial 
derivatives of order w — 2, and the same is true of the functions y,-, yu, yut' 
On a preceding page it was stated that in y-space the family of curves (45 • 8) 
contains all the arcs of the family (45 -6). This can be seen, at least for values 
(t, 1?, 17') sufficiently near to (to, i?o, tJo)> by changing the parameter t in (45 *8) 
to p + qi and noting that the 2n constants a, ft, p, q can then be determined by 
implicit function theory so that the initial values at / = t of the functions 
(45 • 8) and their derivatives with respect to t are q and 17'. 
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Theorem 45 • 2. Tie extremals through a fixed point ^ on a non^'Singular extremal 
arc Eo constitute an (n — Xyparameter family which may he represented hy func- 
tions of the form 

(45‘9) yi (/, ail • • • > ttn— i) = yx (/, a) . 

The family contains Eq for values (t, a) satisfying conditions of the form 

/lO = ^ = ^20 > ttr “ ttrO (^~1» •••> ^““1)» 

and the extremals of the family pass through the point 0 for a fixed parameter 
value t = To the interval tiotao* The functions yi, y,t, ynt have continuous par- 
tial derivatives of at least the second order in a neighborhood of the values (t, a) be- 
longing to Eo, and the matrix 

y\-r yi 

is of rank n + \ at every point of Eo. 

The extremals of the theorem are those of the family (45 -8) for which 
Or = arQ and with the notations ar = K. The matrix (45 10) consists of the 
last » + 1 columns of the determinant (45 • 3) for the family (45 • 8) and hence 
must be of rank n + 1 along JEo, by the preceding theorem. 

We may determine an (n — l)-parameter family of extremals through a 
point 0 on Eo by a second method. Let us suppose that a (2n — 2)-parameter 
family of extremals (45 • 2) with the properties described in Theorem 45 • 1 has 
been found by any method of integrating the Euler differential equations. The 
equations 

(45-11) y»o = y<(A), a, b) 

have as a particular solution the values (^o, o, b) = (to, ao, 6o), where to is the 
value of / determining the point 0 on Eq, At this solution one determinant, at 
least, of the matrix 

II y.-., y.-6, y.dl 

is different from zero, since the determinant (45 -3) is different from zero. This 
determinant can be chosen among those including the last column, since the 
elements of that column cannot all be zero. Hence the n equations (45 • 11) have 
solutions for to and » — 1 of the variables a, b in terms of « — 1 others of the 
variables a, 6, which may be denoted by ai, . . . , an~i. These equations there- 
fore determine solutions /o(a), <Jr(o), br(a) reducing to to, tfro, bro for ttf — flrO 
(r * 1, . . . , n — 1) and having continuous partial derivatives of at least the 
second order near a = ao. The matrix of derivatives of the functions flr(a), 
br{a) is of rank w — 1, since n — 1 of these functions are identical with the 
variables ar. The arcs defined by the functions 

yi [t, (a) , b (a) 1 = y, (/, a) 


(45-10) 
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pass through the pomt 0 for the parameter values / » /o(a) and have the con- 
tinuity properties described in the theorem, except that the constant to is re- 
placed by the function /o(a)* The matrix (45 *10) for the family so determined 
is the product of the matrices 




9 

9 





0 0 
0 0 
1 0 
0 1 


in which r and s both have the range 1, . . . , »— 1. The product matrix is of 
rank n + 1| since the last two matrices have, respectively, the ranks 2n and 
w “h !• 

A final remark concerning the matrix (45 * 3) of Theorem 45 * 1 may be of 
interest. The family of extremals 


yi^P + qU h) = Yiity h, Py q) 


obtained from the family (45 * 2) by replacing thy p + qt contains 2n constants. 
The determinant 


(45*12) 


Yihr 

Y'ia, F;.6, 


Yip 

r tp 


is the analogue for this family of the determinants (7 * 6) and (36 * 2), which are 
different from zero along extremal arcs of the families (7 *5) and (36 * 1), respec- 
tively, in the non-parametric cases. By very simple calculations it can be shown 
that the last determinant is equal to the determinant (45 • 3) multiplied by 

46. The envelope theorem and Jacobi’s condition. In the space of n-dimen- 
sional points y a one-parameter family of extremals through a point 1 can be 
represented by functions of the form 

(46* 1) y, (/, 0) [h W^t^ h {P), Po ] , 

in which /3 is a single parameter and h{p)y tiiP) are functions having continuous 
derivatives on an interval Pi^ P ^ Po, We assume that the functions y,*, yu, yut 
have continuous partial derivatives of at least the second order in a neighbor- 
hood of the set of values (/, p) satisfying the conditions in the square brackets 
and that, at least along the arc of the family corresponding to the parameter 
value poy the derivatives yifi are not everywhere zero or proportional to the 
derivatives yj. This last assumption is made in order to be sure that the arcs 
of the family for values p near Po are not identical with the arc corresponding 
to Po, 

For the parameter values ti(P) the arcs (46 *1) are all supposed to pass 
through the point 1 with coordinates (yu, . . . , y^i), so that the equations 

(46*2) y««yiU(l8), P] iPi^p^Po) 
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are identities in p. The arc D defined by the equations 

(46-3) y, = y. [t, (0) , p] = F. (p) {pi^P^ Po) 

will be an .envelope of the extremals of the family (46 - 1), provided that equa- 
tions of the form 

(46-4) Y\{p) =K{p)y\[U{p), p] 

hold, where k{P) is positive on the interval Pi ^ P < Pq. On that interval the 
arc D has no singular point, since the derivatives y ' never vanish simultaneously, 
but it may have a singularity at = jSo if i^iPo) = 0. 

It is easy to see that the auxiliary theorems of Section 8, extended to the 
problem in (x:, yi, . . . , yn)-space, are applicable to one-parameter families of 
extremals for the parametric problem when we change the name of the inde- 
pendent variable from x to t. If in Figure 10 1 we change the number 6 to 3 
and designate the arc of the family (46*1) which corresponds to Po by Eu, 
then, since the arc C of Corollary 8 * 1 is now the point 7, we have the equation 

I (En) -/(Eu) =7* (774.0. 

In the invariant integral 1* for the parametric case the coefficient / — of dt 
is zero on account of the first formula (43 2). With the help of the homogeneity 
property (43-3) of the derivatives fy\ and (43 2) again, we find fy\{Y,y') = 
/,;(F, F') and 

7* (7743) = Pfy^,(¥, n Y[{p)dp 

= rV(F, Y')dp^HD,,). 

Hence we have the following theorem: 

Theorem 46 * 1. The Envelope Theorem. For a one-par ameter family of 
extremals {46 -I) through a fixed point 1, having an envelope I) as shown in Figure 
10' 1 with 6 replaced by 3, the equation 

I {El,) =7(Ei4) + 7 ( 7 ) 43 ) 

holds for every point 4 sufficiently near to 3 on D. 

If we define a point conjugate to the point 1 on an extremal arc E 12 as a point 
of contact 3 of E 12 with an envelope 7) of a one-parameter family of extremals 
through the point 7, such as is described above, then we may formulate a prop- 
erty for non-singular extremals as follows: 

IV. A non-singular extremal arc E 12 is said to satisfy the condition IV of Jacobi 
if it has on it between 1 and 2 no point 3 conjugate to 1. 

By a method similar to that of Section 10 for the non-parametric case we 
can now prove the following theorem: 

Theorem 46 *2. Every non-singular minimizing arc E without corners must 
satisfy the condition IV of Jacobi. 
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A non-singular minimizing arc without corners is always an extremal, by 
Theorem 44 - 1 and Corollary 44 -3 to the condition I. By the envelope theorem 
the value of the integral 1 along the composite arc £14 + + £32, in Figure 

10 ’ 1 with 6 replaced by J, is the same as that along £12. This composite arc is 
not an extremal, since by our hypothesis it is not identical with £12 for small 
values of and since £12 itself is the only extremal arc through the element 
(y, y') belonging to £12 at the point 3. Hence, in every neighborhood of the 
composite arc there are other arcs with the same end-points giving the inte- 
gral 1 a smaller value than that given by the composite arc, and the same is 
clearly true of £12 itself. 

47. Analytic proof of the condition of Jacobi. A. non-singular minimizing 
arc £12 without comers for the parametric problem is necessarily an extremal, 
as was stated in the last paragraph. The second variation along such an arc has 
the form 

(47-1) 72(1?) = /‘*2a,(i7, 

where 2cj is the quadratic form (43 * 7). It can be proved, as in Section 4 for the 
non-parametric case, that for a minimizing arc the second variation h{yi) must 
be non-negative in the class of admissible sets >;»(/) for which = t 7<(/2) = 0. 

The accessory minimum problem is that of minimizing 72(77) in the class of 
admissible variations with elements vanishing at h and k. The differential 
equations of the accessory extremals are the accessory equations 

(47-2) /.(U) = 

In the discussion of the Jacobi condition for parametric problems it turns out 
that the solutions of these equations which satisfy the relation y[tji = 0 
play an important auxiliary role. Such solutions are called normal solutions of 
the accessory equations. 

Definition of a Conjugate Point. On a non-singular extremal arc Ei2 a 
point 3 is said to be conjugate to the initial point 1 if there exist constants pi, p2 
and a solution of the accessory equations with functions 77,(0 ^ot everywhere pro- 
portional to the functions yl{i) on the interval i\U but such that 

7?i (/i) = piy,- (/i) , 77, (/a) = Pay,- (/a) . 

This definition is entirely equivalent to saying that there exists a normal 
solution Ui{t) of the accessory equations vanishing at t\ and /a but not identically 
zero on the interval Wa. The functions «,(/) = 77,(0 piOyliO with p = 
yivklylyk are, in fact, such a normal solution, since the equations (47 -2) are 
linear, and the fourth equation (43 7) shows that the set py[ is also a solution. 

If we use this definition in place of the geometric definition of a conjugate 
point in Section 46, we can again prove Theorem 46 -2, which says that on a 
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non-singular minimizing arc £12 without comers there can be no point 3 con- 
jugate to 1 between 1 and 2, To establish this result, suppose that there were a 
conjugate point 3 between 1 and 2, and consider the variations r\i(t) defined by 
the conditions 

•ni (0 = Ut (/) (/i ^ ^ /a) 

i?t(d==0 

where «*(/) is a normal solution not identically zero on the interval hh but van- 
ishing at /i and /a. The functions tj* so defined would give to the second variation 
the value zero but would not minimize it, as one sees by an argument like that 
of Sectional!, supplemented by the following additional remark. The corner 
condition a),;.(/3 — 0) = + 0) and the relations yiui s 0, Ui{h) = 0 

would imply in the present case that at / = /a 

21 =3'Itwifc=0. 

These equations, however, could not be satisfied, since by Theorem 43 • 1 they 
would require the derivatives Uk to vanish at / = /a, as well as the functions w*. 
This is not possible if the functions Ui definng the normal solution are not all 
identically zero, as will be seen in the first paragraph of Section 48 below. 

48. The determination of conjugate points. Every normal solution of the 
accessory equations for a non-singular extremal arc £12 satisfies with X = 0 the 
equations 

(48- 1) +Xy; = 0, (yU0" = O, 

which are linear and homogeneous in the variables r/j, t;/, t;/', X. These equations 
can be solved for the variables X, since the determinant of the coefficients 
of the 17/' and X is the determinant (43 *8). On account of the identity yiJi(rf) = 
0, from (43 *7), the solution X is identically zero. The solutions for have the 
form 

(48-2) V, v') , 

in which the functions At are linear and homogeneous in rjt and 17/ with coeffi- 
cients functions of / having continuous derivatives of at least' the first order on 
the interval /ife. Every solution rji(t) of equations (48 • 2) is a solution with X = 0 
of equations (48*1), and conversely. From well-known theorems concerning 
linear differential equations a solution of equations (48*2) is identically 
zero if it vanishes with its first derivatives at a single point to. This is in par- 
ticular true of a normal solution of the accessory equations (47 -2). 

Lemma 48 * 1. For every set (c = 1, . . . , 2n — 2) consisting of 2n — 2 
solutions of the accessory equations the determinant 

T7ia V, 0 

(48-3) d{t)= , 

yi yi 
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is either identically zero or everywhere different from zero. When d(t) ^ 0 the 
system ifte is called a fundamental system of solutions of the accessory equations. 

To justify the lemma we note first, with the help of the fourth equation 
(43-7), that when * l/(y.'y0 the sets <pyl and tvy'i form two independent 
solutions of the equations (48- 1) and therefore also of (48-2). For «• = 1, . . . , 
2« — 2 the functions Uu, = rn, — p^yl with p« = ymiuhiyi form 2» — 2 nor- 
mal solutions of the accessory equations, and all the columns of the determinant 


(48-4) 


Ui, <pyi t<pyi 

Uic (ipyi) {tipyi) 


are therefore the elements and their derivatives of solutions of the equations 
(48*2). From well-known theorems concerning linear differential equations 
such a determinant is either identically zero or nowhere zero. By easy reduc- 
tions the determinant (48 -4) is seen to have the value which proves 

the lemma. 

Lemma 48 -2. If the set ruder =* 1, . . . , 2n — 2) solutions of the accessory 
equations is a fundamental system^ then the determinant 


has the form 


Vitr (0 yi it) 0 

h) = 

Viti (h) 0 yi (h) 

D(t, /i) = /i), 


with X(ti, ti) = ±d(ti), and hence is diferent from zero near the value ti. If the 
set riig is not a fundamental system, then D(t, ti) vanishes identically. 

The determinant D{t, /i) may be written in the form 

(/) - (/i) yi it) - yi iti) - yi (h) 

D it, /i) * , , . 

(/i) yi iti) yi ih) 

After applying Taylor’s formula with integral form of remainder and removing 
a factor it — the remaining factor X(/, /i) has the properties described 
in the lemma. 

If the set vii, is not a fundamental system, the determinants (48-3) and 
(48*4) vanish identically. There is then a set of constants d, e, not all zero, 
satisfying the 2n linear equations whose coefficients are the rows of (48*4) at 
t » h. The n functions 

it) +dip it) yi it) + etip it) yiit) (i « 1, . . . , n) 

vanish identically, since they vanish with their derivatives at / » and form a 
solution of equations (48 *2). The determinant 


UiAt) ^ it) Vi it) tip it) yiit) 
Uia ih) *p ih) y ih) hip ih) yi (O 


48*5) 
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therefore vanishes identically, and so does Z?(/, h), since (48 S) has the value 
(/i - /) ^(0 D{t, h). 

Theorem 48*1. 7/ 2n - 2 solutions i;i,(t) (cr = 1, . . . , 2n - 2) the ac- 
cessory equations for a non-singular extremal arc have a determinant D(t, ti) 
not identically zero, then each root U of the equation J){U, u) == 0 on the interval 
ti ^ 1 ^ I 2 determines a point 3 conjugate to 1 on E^, and conversely. 

If tz is a root of the equation oltz, ti) = 0, the linear equations 

«tcr (/a) Ca + <p (/a) yi (tz) d + tz<p (tz) y'i (tz) 0 = 0, 

(/i) Ce + <p (/i) y'i (/i) d + tup (ti) yi (h) 0 = 0 

have solutions Cc, d, c, with d = 0 = 0, since yluia = 0 for every a, and with 
constants c„ not all zero. The solution Ui = UwCa of equations (48 *2) is then 
normal and vanishes at h and tz and cannot be identically zero on the interval 
htz since D{t, h) lA 0 near t\, by Lemma 48 *2. Hence the point J on E 12 is 
conjugate to 1, 

Conversely, if «<(/) is a normal solution of the accessory equations determin- 
ing a point 3 conjugate to 1 on £ 12 , then there are constants d, e, not ail 
zero such that 

Ui^UicC^ + (py\d+tipy\e , 

since every solution of the equations (48*2) is linearly expressible in terms of 
the 2n solutions uur, <pylj tipyl, which are linearly independent because the de- 
terminant D(ty h) is, by hypothesis, not identically zero. From Ui{ti) = Ui{tz) = 
0 it follows that the determinant (48*5) and hence D(ty ti) are zero at / = tz- 
Theorem 48 *2. If the solutions riir(t) (r = 1, . . . , n — 1) 0 / /Ae accessory 
equations along a non-singular extremal arc E 12 have a determinant 

A(t) ^\vir(t) yUt) I 

not identically zero and with rank one a/ 1 = ti, then each root ta of A(t) on the 
interval ti < t ^ t 2 determines a point 3 conjugate to 1 on E 12 , and conversely. 
The determinant 

(48*6) \u,r (t-h)^yi\ 

is easily seen to be equal to (/ — /i)^A(/). Its columns are n solutions of the equa- 
tions (48- 2), all of whose elements vanish at / = since the solutions Uir are nor- 
mal and have elements proportional to those of the set yl at / = /i. If the determi- 
nant of the derivatives of these n solutions is different from zero at / = t\, 
then by applications of Taylor’s formula with integral remainder term it follows 
that the determinant (48-6) is the product of (t — ti)^ by a factor different 
from zero at / = /i. On the other hand, if the determinant of derivatives vanishes 
at / = ti, there will be a linear combination of the columns of (48*6) with con- 
stant coefficients not all zero which vanishes with its derivatives at < = /i 
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and is therefore identically zero. Thus we see that A(t) either has the form 

0 - A)’*~VW with mOO p^Oor else is identically zero. 

Suppose, now, that A(i) is not identically zero and that /j is one of its roots. 
The equations 

Cr + y<«*)d = 0 

have solutions Cr, d not all zero, but with rf = 0 since the solutions «»> are nor- 
mal. The normal solution Ui — UirCr then vanishes at h and h and is not identi- 
cally zero, since A(/) ^ 0. Hence /a determines a point conjugate to 1. 

Conversely, let Ui(t) be a normal solution defining a point 3 conjugate to 

1 on jBi 2 . It is a solution of equations (48 -2) with elements vanishing at t\ and 
therefore expressible in terms of the columns of (48 -6) in the form 

Ui (/) = Uir (/) Cr “{“ (/ fl) ip (/) yi (/) d , 

as one sees by an argument like that used in the proof of Theorem 12 *3. The 
constant d is zero, since the solutions Mi and «,> are all normal. The constants Cr 
are not all zero, since the normal solution Ui is not identically zero. Since the 
values Ui{t^ are all zero, it follows readily that h is a root of A(/). 

Corollary 48 - 1. Let E 12 be a non-singular extremal art contained for 

/lO ^ ^ ^ ^20 I Of — > br = ^rO •••> U — 1) 

in a (2n — lyparameter family of extremals 
(48-7) y,(/, a, b) 

with continuity properties as described in Theorem 45 • 1 and such that the determi- 
nant 

D (,t, tij o, b) ~ , 

yia, ih) y,», (/i) y< {h) 

is not identically zero along E 12 . Then the points 3 conjugate to 1 on E 12 are deter- 
mined by the zeros ts ^ tio of D(t, tio, ao, bo). 

This follows from Theorem 48-1, since by the argument of Section 12, the 
2» — 2 sets yian y^r are solutions of the accessory equations. 

In the proof of Theorem 45 1 it was stated that the determinant (45 *3) is 
either identically zero or everywhere different from zero. This is a consequence 
of the statement in the preceding paragraph and of Lemma 48*1. 

It is interesting to note that except for a factor (/i — the determi- 
nant D of Corollary 48 * 1 is the determinant of the derivatives of the functions 
yi(p + 09 <^9 *)> yiip + 019 <^9 b) with respect to the 2n constants <ir, 6r, q, as 
one can see by simple transformations. This fact emphasizes the analogy be- 
tween the determinant of Corollary 48 *1 and the corresponding determinants 
of Corollary 12*1 and (36*7) for the non-parametric theory. This remark sup- 
plements the comment concerning the determinant (45*3) made in the last 
paragraph of Section 45. 
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The determinant D of the corollary is certainly not identically zero when 
the family (48-7) has its determinant (45-3) different from zero along £i*, 
since then the solutions y*, of the accessory equations are Unearly inde- 
pendent and we have seen in Lemma 48 • 2 that D is not identically zero neat 
t = h. 

Corollary 48*2. Let E12 be a non-singular extremal arc contained for 

/lo = / S ^20 , ttr = ttrO ( r = 1 1) 
in an (n — l)-paramcter family of extremals 
(48-8) yAt, a) 

all passing through the point 1, with continuity properties as described in Theorem 
45 • 2 and such that the determinant 

A (/, a) = I yia^ y\ | 

is not identically zero along E 12 . Then the points 3 conjugate to 1 on E 12 are deter- 
mined by the zeros U ^ tio of A(t, ao). 

If the extremals of the family (48*8) all pass through the point 1 for the 
parameter values /i(a), where /i(a) has a continuous first derivative near a = 
tto, then from the equations 

yti = y»[^i(a) , a] 

it is found by differentiation that the rank of A(/, a) is unity at t = i\. Hence 
the corollary follows from Theorem 48*2. 

LsMiiCA 4S ’ 3. If Ei 2 is a non-singular extremal arc having on it no point 3 con- 
jugate to 1, then there is always a point 0 on the extension of E 12 in the order 012 
which also has no conjugate on E 12 . 

We have X(/, /i) 0 on /i < / ^ fe in the formula of I-rCmma 48 -2, since £12 

contains no point conjugate to 1 and since X(/i, h) 7 ^ 0. It follows that X(/, /©) 
remains different from zero for /i ^ /b if ^0 is taken less than t\ and sufficiently 
near to h. For such values of /o the determinant D{t, to) is different from zero 
on tih. The lemma now follows from Theorem 48 - 1. 

The following theorem shows that the conjugate points defined in Section 46 
are also conjugate points as defined in Section 47 ; 

Theorem 48 • 3. On a non-singular extremal arc E 12 every contact point 3 of an 
envelope of a one-parameter family of extremals through the point 1, as described in 
Section 46, is a point conjugate to 1 on E 12 in the sense defined in Section 47, 

To prove this, consider the one-parameter family of extremal arcs (46 1). 
The derivatives jSo) of this family define a solution of the accessory equa- 
tions along the arc E 12 . From equations (46 • 2), (46 • 3), and (46 • 4) it is seen that 
this solution satisfies the conditions 

yi [ti (0o) > Po] h (Po) +yx0 [/i (Po) > ^ 0 ] = 0, 

yi [^4 (po) I /3o] [^4 (Po) ^ (Po) ] *t" y%0 [^4 (Po) 9 ^o] ~ 0 



124 


LECTURES ON THE CALCULUS OF VARIATIONS 


and hence has elements proportional to those of the set yl at /i(i8o) and h » 
U(Po)» Its elements are not everywhere proportional to those of the set y/, 
however, as one sees by reference to the hypothesis in the first paragraph of 
Section 46. Thus the contact point 3 of the arc £12 with the envelope Z> 43 , as 
shown in Figure 10 * 1 with 6 replaced by J, is a conjugate point of the kind de- 
fined in Section 47. 

4Q. Fields and a fundamental sufficiency theorem. For parametric problems 
the notion of a field is analogous to that described for the non-parametric prob- 
lem in Section 18, but the homogeneity of the integrand function / is the cause 
of some modifications in the theory which will appear in the following para- 
graphs. 

Definition of a Field. A field is a region F of y-space containing only in- 
terior points and having a set of slope functions pi{yiy . . . , yn) with the follow- 
ing properties: 

a) The functions pi(y) have continuous partial derivatives of at least the 
first order in F and are normed so that piPi = 1; 

b) The sets [y, ^(y)] corresponding to F are all admissible; 

c) The integral 

(491) I*^fMly.Piy)]dyi 

is independent of the path in F in the sense that it has the same value on all 
continuous arcs in F consisting of a finite number of regular sub-arcs and hav- 
ing the same two end-points. 

The assumption that the functions pi{y) are normed is convenient for the 
sequel but not necessary. On account of the second homogeneity property 
(43 *3), the functions pi{y) in the integrand of /* could be replaced by any set 
Kpi {k > 0) without changing the value of the integrand. 

In the definition (49* 1) of the invariant integral no term in dt occurs. This is 
because, by analogy with the non-parametric problem, the coefficient of dt 
would be / — ylfy'i, which vanishes identically by (43 -2). 

For the functions 

Aiiy) P (y) 1 

we find, with the help of the homogeneity properties of the derivatives of /, the 
relations 

~ ~ » 

analogous to the relations (18-5) and (37-6). 

When i* is independent of the path, the differences 3i4</dyt — di4*/3y< all 
vanish, and the last equation shows that the solutions of the differential equa- 
tions 
(49-2) 
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are all extremals. They are called the extremals of the field. They form an 
(« — l)-parameter family, since one and but one of them passes through 
each point of F. Furthermore, on every extremal arc Ea of the field /*(£«) = 
HEii), from the definition (49-1) of I* and the property (43 -2) of /. 

If a continuous arc yi(T) in F consists of a finite number of regular sub-arcs 
and is a solution of the equations 

(49-3) U>0), 

it will necessarily be an extremal arc of the field. For in terms of the new 
parameter 

l=f K{r)dT, 

the arc will satisfy the equation (49 *2), as one readily verifies. 

Theorem 49 *1. The Formula of Weierstrass. If En is an extremal arc of 
a field F and if C 12 is an admissible arc in F joining the ends of E 12 , then 

HCn) -HEn) = r^Ely, p (y) , y]dt, 

where the variables y in the integrand are to be replaced by the functions yi(t) de- 
fining C 12 . 

The proof of this theorem is the same as that of the analogous Theorem 19* 1 
for non-parametric problems. 

Theorem 49-2. A Fundamental Sufficiency Theorem. If E 12 is an ex- 
tremal of a field F in which the condition 

E[yj p (y) , y'] > 0 

holds for every admissible set (y, y') with yi in F and y,' ^ ^pi (x > 0), then the 
inequality I(Ci 2 ) > I(Ei 2 ) is true for every admissible arc C 12 in F which is not 
coincident with E 12 . 

To prove the theorem we may suppose that the parameter t on C 12 is the 
length of arc. From Theorem 49 - 1 it follows that /(C 12 ) > /(JSw) unless y/ = 
Kpi (k > 0) at every point of C 12 . But in that case we would have ic = 1, since 
y<y< == 1 and pipi = 1, and hence C 12 would be an extremal of the field. This is 
impossible unless C 12 coincides with £ 12 , since £12 is the only extremal of F 
through the point 1. 

As in the non-parametric case, there are a number of methods for the con- 
struction of the so-called Mayer families of extremals whose slope functions form 
fields in every region F of y-space which they simply cover. Such families are 
defined by equations 

(49-4) y» = y»(^ ai, On-i) «y<(^ a) [(/, a)inr] 

containing » — 1 parameters. The functions y<, yu are supposed to have con- 
tinuous partial derivatives of at least the second order for values (/, a) m the 
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region T. A family of this sort is said to simply cover a region F of y-space if 
the equations (49 -4) have a unique solution [/(y), a(y)] in T for each point y in F. 
The functions 

(49* S) pi (y) = y\ [t (y) , a (y) ] 

are called the slope functions of the family in F. If the determinant A(/, a) = 
|y,«, y[\ is different from zero at the values (/, a) corresponding to points in F, 
then by the usual implicit function theorems it follows that the functions /(y), 
a(y) have continuous derivatives of at least the second order in F. 

The slope functions of a family (49*4) will not always form a field in a region 
F which is simply covered by the family. By methods analogous to those of 
Sections 20 and 37 it can be shown that a family (49 *4) will be a Mayer family 
and form a field in every region which it simply covers if it is intersected by a 
single (« — l)-space S on which the integral of the family, from (49- 1), is 
independent of the path. The {n — l)-space S is supposed to be defined by a 
function t *= r(a) substituted in the equations (49 *4). Special cases are again 
those for which S is a fixed point, or an (« — l)-space which cuts the extremals 
of the family transversally so that 1* has the value zero on every arc on S. 

To construct a Mayer family in most general fashion, let us start from a 
(2» — 2) -parameter family of extremals containing a particular extremal arc £, 
as described in the imbedding theorem. Theorem 45 • 1. Let us intersect E by an 
arbitrarily selected (» — l)-space S not tangent to £, defined by functions 
Fi(ai, . . . , ttn-i). The point of intersection of 2 and E is supposed to be de- 
fined on 2 by values aro near which the functions Yi{a) have continuous second 
derivatives, and on E by a parameter value /o. At those values the non-tangency 
of 2 and E is expressed by the fact that the determinant | Y^r yi 1 is different 
from zero. 

To determine an in — 1) -parameter family of extremals intersected by 2 and 
having an integral /* independent of the path on 2, we determine Or, &r, T as 
functions of the parameters a by means of the equations 

(49*6) y,(r, a, b) « (a) , /,;[F, y (T, a, b) ] Ua,. 

Here l/(a) is an arbitrarily selected function having continuous third deriva- 
tives near the values aro and having first derivatives which satisfy the second 
set of equations (49 -6) at the special values (a, a, b, T) = (oo, ao, 6o, /o) defin- 
ing the intersection of 2 and £. The functional determinant of the first members 
of equations (49 *6) with respect to the variables Cr, 6r, T is not zero at this ini- 
tial solution. It is, in fact, the product of 

bik 0 F.-., 

® fy^Yia^ = — y< 

0 0 
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by the non-vanishing determinant (45 *3), divided by The product dis- 

played above is not zero, since 2 is not tangent to E and since E is non-singular. 
Hence the equations (49*6) have solutions <ir{a), br{a), T{a) with continuous 
derivatives of at least the second order near the values aro and such that ar(ao) = 
flro, Jr(ao) = bro, T{ao) = to. If we substitute the functions ar(a), br(a) in the 
functions (45-2), a family of the form (49-4) is obtained which is intersected 
by the surface S for / = T(a), The integral 7* of the family is independent of 
the path on 2, since from the second set of equations (49*6) its integrand has 
on the space 2 the value 

fvi^y% “ fvi ^ ^ dU • 

Hence the family of the form (49 -4) so constructed is a Mayer family and has 
slope functions defining a field in every region F which it simply covers. 

50. Sufficient conditions for relative minima. The conditions I-IV which 
appear in the sufficiency theorems below for relative minima were described 
in Sections 44 and 46. As in preceding chapters, we shall use the symbols II' 
and III' to denote the conditions II and III strengthened to exclude the equality 
signs, and 11n to denote the condition II extended to hold in a neighborhood N 
of the elements (y, y') on an admissible arc E, The symbol IV' will denote the 
condition IV of Section 46 with the added restriction that the point 2 is not 
conjugate to the point 1 on £ 12 . The definition of conjugate point to be used is 
that of Section 47. With these definitions agreed upon, we may state the fol- 
lowing lemma: 

Lemma 50 - 1. 7/ a non-singidar extremal arc E 12 not intersecting itself has on it 
no point conjugate to its initial point 1, then there exists a field F of which E 12 is 
an extremal. 

To prove this, we note first that by Lemma 48 ■ 3 there is a point 0 on the 
extension of £12 in the order 012 such that 0 has no conjugate on £ 12 . Ac- 
cording to Corollary 48 *2, this insures the non-vanishing on £12 of the deter- 
minant 

A(/, o) = |y<., yil 

belonging to the (» — l)-paranieter family of extremals (48 -8) passing through 
the point 0 and containing Ea. Without loss of generality we may assume that 
the parameter t in the equations of this family is the length of arc measured 
from the point 0. The equations 

(50- 1) y. = y. (/, o) 

have as initial solutions the points (y, /, o) belonging to £ 12 . This set of points 
contains no two distinct points with the same projection y, since £ 12 , by hy- 
pothesis, does not intersect itself; and, furthermore, A(/, o) 0 on this set, 
as we have just seen. Hence, from a well-known implicit function theorem, the- 
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equations (50-1) have solutions i(y), a,(y) with continuous partial derivatives 
of at least the second order in a neighborhood F of the points y on Ea and such 
that Oriy) ttro on £12. The functions 

Pi (y) = yu [< (y) , a (y) ] 

are a normed set of slope functions of the family (50-1) in F. For every con- 
tinuous arc Du in F consisting of a finite number of regular sub-arcs and for 
the extremals £os> £04 joining the point 0 to the ends of Du, we have the relation 

/(£,4) -/(£„) =/*(/), 4). 

This is a consequence of the extension of Corollary 8 ■ 1 to parametric problems, 
as explained and used in the third paragraph of Section 46. Hence 7* has the 
same value on all arcs D in 7^ joining the same two points 3 and 4 and is inde- 



pendent of the path in the sense described in the definition of a field in Sec- 
tion 49. The region F with the slope functions Piiy) is, therefore, a field whose 
extremals are the extremals (50-1). 

Theorem 50- 1. Sxjeficient Conditions por a Weak Relative Minimum. 
If an admissible arc Eu withotU comers and not intersecting itself satisfies the 
conditions 7, 777', IV, then there is a neighborhood H of ihe dements (y, y') on 
Eis such that the rdation I(Cu) > I(Eu) holds for every admissible arc C12 in N 
which is not coincident with Eu. 

To prove this we note first that the arc £u is a non-singular extremal, by 
Lemma 44-2 and Corollary 44-3. Hence by Lemma 50-1 it is an extremal of 
a field F in y-space with slope functions pfy). The hypothesis of Corollary 44-4 
is satisfied by £u because of the property IH' of £12. 

To select a neighborhood N with the properties of the theorem, we may con- 
sider the set S of points [y(0, y'(01 (^1 ^ ^ ^ O on the arc £12 with the param- 
eter t as length of arc. An c-neighborhood S, of the set S can be selected with 
e so small that all of the points [y, p(y), y'] corresponding to points of 3$ are 
in the neighborhood No of Corollary 44-4 in which the £-function remains 
nmi-i^gative. The set 3, extraded to contain all of its related points is then a 
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neighborhood N with the properties specified in Theorem 50 - 1, as one may see 
with the help of the fundamental sufficiency theorem, Theorem 49*2, if it is 
agreed that the admissible arcs of this latter theorem shall be those whose ele- 
ments (y, y') lie in the neighborhood N. 

In order to prove for parametric problems a sufficiency theorem for a strong 
relative minimum analogous to Theorem 16 *1, we may make use of Lemma 
16 *1, which says that on a non-singular extremal arc £12 the condition IIjv 
implies Iljy. The proof of this lemma needs the following emendations in order 
to be effective for the parametric case. As in the non-parametric case, we con- 
sider a neighborhood N of the arc £12 in which IIjv holds and the determinant 
(43 • 8) is different from zero. We further infer that, if £(y, y', F') were zero for 
a normed set (y, y', F') with (y, y') in N and F' 9 ^ y\ then for these values 
y, F' considered as fixed, the function £(y, y', F') would be minimized by the 
values y'. Hence with the help of Theorem 43 • 1 and the second formula (43 * 2) 
we should have F' = — xy' {k > 0), since 

^ tfviVk ^ ® » 

and also, with the help of the homogeneity properties of / and the second order 
conditions for a minimum of £ at y', 

I = yifviUUkVi^k'^l ~ Jvhyx^k'^ 1=0 

for all sets ir. But this last inequality is impossible if N is suitably chosen, since 
II implies III as in the non-parametric case, since the non-singularity of £12 
and III imply III' by Lemma 44 • 2, and since III' along £12 implies III' at 
each element of a neighborhood N of £12 by Lemma 44 *1. 

We may now prove for the parametric case the following analogue of Theo- 
rem 16*2: 

Theorem 50 *2. Sufficient Conditions for a Strong Relative Minimum. 
If an admissible arc E 12 without corners and not intersecting itself is non-singular 
and satisfies the conditions /, /7n, ZF', then there is a neighborhood F of the points 
y on Ei 2 such that the inequality I(Ci 2 ) > I(Ei 2 ) holds for every admissible arc C 12 
in F which is not coincident with E 12 . 

The proof is like that of Theorem 50 * 1 provided that we select the neighbor- 
hood F so small that for points y in F the sets [y, ^(y)] are all in a neighborhood 
N where the condition II^ holds. 

It is evident that these theorems establish for the parametric problem in 
n-space, and for admissible arcs without comers and not intersecting them- 
selves, the results implied in the first two rows of the table in Section 17. 
In the next section below we shall see that the third row, and a quite different 
sufficiency theorem for a strong relative minimum, can be justified for para- 
metric problems for which the region R of admissible elements (y, y') has suit- 
able properties. 
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51. Further sufElcient conditions for strong relative minima. The first 
theorem to be proved in this section is effective for a parametric problem for 
which the region R of admissible elements consists of ail sets (y, y') for which y 
is in a region Ry of 3 ;-space containing only interior points with y<y< 9 ^ 0. For 
such problems it turns out that the conditions I, II', III', IV' are sufficient for 
a strong relative minimum. 

In order to deduce this result, we shall need some further properties of the 
£-function. Let pi, qi be two directions normed and orthogonal to each other, 
and let ai(S), ai{B) represent the directions 

ai {$) * pi cos 0 + Qi sin 6 , a*- (^) = — pi sin d + g, cos 0 (0 ^ 0 ^ x) , 

which are also normed and orthogonal. In the following paragraphs we shall 
denote these directions frequently simply by a and a. We have, first of all, the 
following lemma: 

Lemma 51 • 1. If at a particular point y the sets (y, y') = [y, a(0)] (0 ^ ^ ^ x) 
are all admissible, then for every o) on the interval 0 ^ w ^ x 

(511) = j^sin(«-e)0[y, a; a, a\de 

= (1 -cos«)Q[y, a (0*) ; a (0*) , a (0*) ) 

where 0* is a suitably selected intermediate value between 0 and co. 

By a simple differentiation we find that 

fy’fiy, a) c; a, a (w) ] . 


The second property (43 -6) and the equation 


then show that 


a< («) = Ui cos { 61 — 0 ) + o,- sin (w — 0) 


(«) fv'i {y, a) = sin {u — 0)Q {y, a; a, a ) . 


The first formula of the lemma then follows from the third formula (43-5) and 
an integration of the last equation. The second formula of the lemma is then 
easily deducible by means of the mean-value theorem for a definite integral. 
CoROLLASY 51 - 1. The function W(y, p, q, w) defined by ike equations 


P> «(«)] 
1 — cos « 

“Qty. ?. ?1 


for 0 < (I) ^ w . 

for « = 0 


is continuous for aU sets (y, p, q, u) such that y is in R,, 

PiPi— I QiQi— 1 = piqi^ 0, 


This follows readily from the definition of W and the second formula of 
Lemma 51*1. 
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Lemma 51 • 2. Ij an admissible arc E satisfies the conditions II' and III', it 
also satisfies the condition 7/^. 

To prove this we note, first, that except when Y' = ± Ky' (* > 0) a particu- 
lar set (y, y', Y') will determine uniquely a set (y, p, q, «) by the equations 

(y/yp 1/2 « _ pyp ( y/ _ ^yp J 1/2 . 

(51-2) 

y'. 

d i (co) Pi cos 0) “f" ^ t sin 0) ^ jy/ Y^) * 

in which p is to be determined by the condition p,q^ = 0. When Y* = —icy' or 
F' = Ky\ the last equation is satisfied by arbitrary normed values q defining 
a direction orthogonal to p, and cj = r or o> = 0, respectively. We have, then, 
for every set (y, y', F') for which y is in the region Ry a set (y, p, q, w) for which 
the equation 

E{y, y\ FO = ( F.'F,') ^ico)] 

= (F;Fp^/Ml-cosco)»F(y, p, g, «) 

holds. The conditions II' and III' imply that the function TF(y, py g, a>) is posi- 
tive on the bounded closed set of points (y, />, g, w) determined by the condi- 
tions 

(y, p) isonJE, 1 = g,gt~ 1 =^tgt = 0, Ogwgir. 

From the continuity of W it follows that there is a neighborhood N of the 
elements (y, y') on E such that W remains positive at points (y, py g, w) for which 

(y, p) isiniV, - 1 = g.g, ~ 1 = />.g. = 0, 

The conclusion of the lemma follows readily. 

Theorem 51 • 1. A Second Sufficiency Theorem for a Strong Relative 
Minimum. If an admissible arc E 12 without corners and not intersecting itself satis- 
fies the conditions /, //', ///', /F', then I(Ei 2 ) is a strong relative minimumy as 
described in Theorem 50 • 2y provided that the region R of admissible elements (y, y') 
consists of all sets (y, y') with y[y\ 9 ^ 0 and y interior to a given region Ry of y- 
space. 

This theorem is an immediate consequence of Lemmas 44 -2 and 51*2 and 
Theorem 50*2. The former of the lemmas says that an arc £12 which satisfies 
condition III' is non-singular, and the latter states that conditions II' and III' 
imply IIJ^. Thus an arc £12 which satisfies the hypotheses of Theorem 51-1 
also satisfies those of Theorem 50 *2. 

Let us say that arc £12 Satisfies the condition IIV if the condition III' holds 
at every element (y, y') with y on £ 12 . The formula (51*1) shows that the prop- 
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erty III" implies the property II' along £ 12 . Hence we have the following 
corollary: 

Corollary 51*2. If an admissible arc E12 withoui corners and not intersecting 
itself satisfies the conditions 1, 777", i 7', then I(Ei 2 ) is a strong relative minimum, 
A corollary similar to this holds when the region R has a suitable convexity 
property, even if it is not of the form described in the first paragraph of this sec- 
tion. We shall say that R is convex relative to the variables y' when it contains 
all of the sets [y, a(0)] (0 ^ d ^ (o) defined by equations (51*2) whenever it 
contains (y, y') and (y, F'). An admissible arc £12 is said to satisfy the condi- 
tion IIIJ- if there is a neighborhood F of £12 in y-space such that the condition 
III' is satisfied at every admissible set (y, y') with y in £. 

Corollary 51*3. If the region R has the property of convexity just described^ 
then an admissible arc E 12 without corners^ not intersecting itself^ and satisfying 
the conditions 7, 777 f, 7F' will make I(Ei 2 ) a strong relative minimum. 

This is a consequence of Theorem 50*2, since the condition IHf implies that 
£i 2 is non-singular, by Lemma 44-2, and the formula (51*1) with Illjs* shows 
that the condition Iliv holds when N consists of all elements (y, y') with y in F, 
For parametric problems a table analogous to that of Section 17 for non- 
parametric problems is given here. The theorem implied in the third row of 

Table or Necessary and of Sufficient Conditions 


Type of Minimum Necessary Conditions SuflSicient Conditions 

Weak relative I, III, IV I, III', IV' 

Strong relative I, II, III, IV I, Hat, IV', E non-singular 

Strong relative I, Illjf, IV' 

Strong relative I, II', III', IV' 

Strong relative I, III", IV' 


the table holds when the region R of admissible elements (y, y) is convex 
relative to the variables y' in the manner described above. The last two 
rows of the table are applicable when 22 is of the form presupposed in the first 
paragraph of this section. 

52. Canonical variables and equations. On account of the homogeneity 
properties of the integrand function /(y, y'), it is not possible to define canonical 
variables (y, z) for the parametric problem by equations as simple as the equa- 
tions (26* 1) and (38*1) used in the non-parametric cases, the reason being that 
for the parametric case the determinant \f,\n\ identically zero, as follows 
from the second equation (43*2). We may, however, define canonical variables 
by means of similar equations whose form is suggested by the theory of the 
parametric problem when formulated as a problem of Lagrange,* equations 
whose effectiveness can be readily understood without reference to the La- 
grange theory. 

* See Teach, "The Hamilton-Jacobi The<»y for the Problem of Lagrange in Parametric 
Form" (dies., Univotdty of Chicago), Contributions 1933-1937, pp. 165-206. 
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Let us define a function G by the equation 

G(y, y', D =/ + /(v>-l) , 

where tp{y, y') is a function with the same homogeneity and continuity prop- 
erties as those of /. Then relations connecting the variables y, y', I with a set 
of canonical variables y, z may be written in the form 

(S21) = /y' ^—1 = 0. 

Near a particular solution (y, y', /, 2 ) at which their functional determinant 
with respect to the variables y', I is different from zero, these equations have 
solutions 

(52-2) yl^Piiy, Z), l^L(y, z) , 

as we know from the usual implicit function theorems. 

As a basis for the Hamilton- Jacobi theory, we shall assume that there is a 
region K of points (y, y', 1) defined by conditions of the form 

(y, yO ini?, 0< |/| <c 

and such that the equations (52 • 1) transform R! in a one-to-one way into a 
region S of points (y, 2 ). Furthermore, we assume that the value of the func- 
tion ip is different from zero everywhere in i?' and that the functional determi- 
nant of equations (52 • 1) with respect to the variables y', I is everywhere dif- 
ferent from zero. Then the equations (52 • 1) have unique solutions (52 • 2) cor- 
responding to each point (y, z) in 5, and by well-known properties of implicit 
functions the functions P,*, L defining them have continuous partial derivatives 
of at least the order w — 1 when the function / has continuous derivatives of 
the wth order (m ^ 1). An admissible set of values (y, 2 ) is, by definition, one 
belonging to the set S. 

A Hamiltonian function £f(y, z) may be defined, in analogy with the defini- 
tions in equations (26 *3) and (38 -3), by the equation 

(52-3) Hiy, z) = [y[Gy^^-GV~^> ^P^z,- f {y, P) =‘L(y, z) . 

The last value for H is found by multiplying the first equation (52 • 1) by Pi and 
summing with respect to i. With the help of equations (52 ■ 1) the derivatives 
of H are found to have the values 

(52-4) (y, P) , = P. . 

These have continuous derivatives of order m — 1 when / has continuous 
derivatives of order w, so that H also has continuous derivatives of order m. 
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The canonical equations of the parametric problem are, by definition, the 
equations 

(52-5) = ^ = P) = -H,,. 

Along every solution y(/), z{i) of these equations the value of J? = L is a con- 
stant, as one may readily verify by differentiation of £f[y(/), «(/)] with respect 
to /. By means of equations (52 • 1) with / = 0 every extremal yi{t) t St fe), 
with parameter t so chosen that ^ « 1 along it, defines a solution yt(/), Zi{t) of 
equations (52-5) on which = L = 0. Conversely, for every solution yt(0> 
Zi{t) of equations (52*5) on which = L = 0 the functions yi{t) define an 
extremal with parameter such that ^ s 1 . The proofs of these statements are 
simple consequences of the equations (52 *4), (52 *2), and (52 *1). 

For the definition of canonical accessory variables (/, 17 , f) the difficulty men- 
tioned above in the first paragraph of this section again arises and may be avoid- 
ed in a similar manner. We define a function Q by the equation 

Vf v'i X) =«(/> V, +X4», 
where 2 « is the quadratic form in the first equation (43 • 7) and 

V, v') =^<Pvi'ni + (Pv^iri'. 

is the first variation of <p. The arguments y(0, y'(0 (/i ^ ^ k) in the coeffi- 

cients of 09 are supposed to be those belonging to an extremal arc E whose cor- 
responding functions y(/), z(/) satisfy equations (52*5) and make L = 0. Ac- 
cessary canonical variables (/, 77 , f) may then be defined in terms of (/, 77 , 77 ', X) 
by means of the equations 

(52*6) f » “ ~ fyiVk’^k "I" “f* X^yJ, ^ = 0 , 

The properties of these canonical variables are summarized in this para- 
graph. They will be proved in the next paragraph below. The equations (52 • 6 ) 
have the solutions 

(52*7) Vi - + , X - Hy.Vi + Hg.^i , 

in which the arguments of the derivatives of H are the functions y(/), z{t) be- 
longing to the extremal arc E. We shall use the notations n<(/, 77, f) and A(/, 77, f) 
for the second members of these equations, respectively. The accessory Hamil- 
tonian function ft is defined by the first equation (52 *8), and its derivatives 
have the values in (52 *9): 

(52-8) ^(/, q, f) = 

(if V'f n, A), 

2 ^ Hy^j^ViVk “f* 2ffy^«j^77iffc "h B k 

*s — Qtf §114 * fUii f — B-i • 


(52*9) 
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In these equations the arguments t, ry, if', X of the derivatives of Q are /, if, n, A. 
The canonical accessory equations are defined to be the equations 


(5210) 


dt 


In order to justify the statements hitherto unproved in the last paragraph 
above, we first differentiate with respect to y* and Zk the equations (52 • 1) 
with their solutions (52*2) substituted. From the relations H — Ly yi == 
Hzi = P% and the fact that L = 0 along an extremal we thus find 

^vk^vi ” ® n^^yi “ > 

^ ~ *^yk ^y'j^^^jVk > ^ ~ zj^k * 


With the help of these equations and by direct substitution it follows that the 
functions (52 -7) are the solutions of equations (52 -6). Next the values (52 -9) 
for the derivatives of ^ can be calculated without difficulty from the second 
expression (52 *8). To deduce the third formula in (52 -8) we differentiate the 
first equation (52 *4) and find 

^viyk ~ ■" fyiyk ““ ““ ^vk'Pyi^ ^y%*k “ fyiVj^^j^k ““ ^*k^yi • 


Hence from equations (52 -9) and (52 *7) we see that 

= n. = Vk + f* ; 


and the desired formula follows at once, since f} is a homogeneous quadratic 
form and therefore 

The canonical accessory equations (52 * 10) have a number of important spe- 
cial properties. There is one and but one solution ift(/), ft(/) of these equations 
which at a given value t takes given initial values if^o, f io. In particular, the 
unique solution determined by the values ifio = f<o = 0 is identically zero. 
Along each solution if(/), f(/) the value of X = A[/, if(/), f(^)] is a constant, 
as one can see by differentiation of the second formula (52 *7) with respect to 
/ and the use of (52 10), (52-5), (52-8). By differentiation of Hiy, z) and the 
equations (52 -5) with respect to t we see that the functions y', 2 ' belonging to 
a solution of equations (52 • 5) are a solution of the accessory canonical equa- 
tions (52 • 10) along which X = 0. The accessory equations for the parametric 
problem are, by definition, the equations 

^ n 

^ = 0, 

and an accessory extremal is a solution n(l) of these equations. Every accessory 
extremal >?(/) defines by means of equations (52-6) a solution ij(0» f(0 of the 
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accessory canonical equations (52 • 10) along which X == 0, and conversely. For 
every family of solutions y<(/, a), Zi(/, a) of the canonical equations (52*5), de- 
pending on a parameter a, the derivatives y,*, Zto are solutions of the canonical 
accessory equations (52 • 10) ; and if ff s 0 on the original family, then X s 0 
on the corresponding accessory extremals. Since the equations (52 *10) are 
linear and homogeneous in the variables 97, and their derivatives, it follows 
from a well-known theorem that a determinant whose columns are 2n solutions 
of these equations is either identically zero or everywhere different from zero. 

Lemma 52* 1. If riw, fi<r (cr = 1, . . . , 2n — 2) are 2n — 2 solutions of the 
canonical accessory equations with X = 0, and if ri, f is a solution with X = 1, 
then the determinant 


(52*11) 


V( 77 i 


is either identically zero or everywhere different from zero and is equal to the determi- 
nant d(t) of Lemma 48 1 multiplied by a non-vanishing factor. 

This follows at once from the fact that the product 


0 0 

/yivy yi 

0 y;. 0 


^3^ y\ 0 7 jj 

y,- yi ^i 

0001 


is equal to the determinant (52 *11) except for a non- vanishing factor. 

A solution rjj f with the properties described in the lemma may be determined 
by initial values at a particular value to satisfying the equations 97, = 0, i = 

1. The 2» — 2 solutions rua, ft<r may be determined by any set of initial values 
which satisfies X = Hy/^k + fi^**?* = 0 at / = /o and makes the initial value of 
the determinant (52*11) different from zero. These are accessory extremals 
which determine conjugate points such as described in Lemma 48 * 2 and Theorem 
48* 1. A set of n — 1 solutions 97, > effective for the determination of conjugate 
points as in Theorem 48 *2 may be determined by a matrix of initial values 
Vir 0, fir with the columns of the matrix ||fir|| linearly independent and mak- 
ing X = 0 at /= /o. 

53. The imbedding theorem and the Hamilton- Jacobi theory. The imbedding 
theorem for a non-singular extremal arc E can be established anew by the use 
of canonical variables and equations. As in Section 27 for the non-parametric 
case, the same continuity properties for the family of imbedding extremals can 
be obtained by this method with the weaker hypothesis that the integrand func- 
tion /(y, yO has continuous derivatives of only the third order, instead of the 
fourth, in a neighborhood R of the values (y, y') belonging to E. We make the 
further restrictive hypotheses that the arc E does not intersect itself in y-space 
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and that the function y') is positive, so that the parameter / can be so 
chosen that ip ^ 1 alon§ E. 

The equations (52 • 1) defining canonical variables have as solutions the set 
of values [y(/), z(t), y'(/), / = 0] belonging to the extremal arc E which we are 
seeking to imbed. No two of these solutions have the same projection (y, «), 
since the arc E does not intersect itself. Furthermore, the functional determi- 
nant of the equations (52 • 1) with respect to the variables y', I is different from 
zero along the arc JE, since E is non-singular, as one may show by the methods 
used in proving Theorem 43 • 1. It now follows, with the help of well-known im- 
plicit function theorems, that there is a neighborhood R of the sets [y(t), y'(0> 
/ = 0] belonging to E which is transformed in one-to-one fashion into a neigh- 
borhood S of the points (y, z) belonging to £, and the functions (52 • 2) so de- 
fined in 5 have continuous partial derivatives of at least the second order, since 
the same is true of the second members of equations (52 ■ 1) when /(y, y') and 
ip{y, y') have continuous third derivatives. 

The second members of the canonical differential equations (52 • 5) are thus 
well defined and have continuous second derivatives in the neighborhood S of 
the arc E. By the usual existence theorems fc r differential equations the unique 
solution of equations (52 • 5) through an initial point (/°, y®, z®) is defined by 
functions of the form 

(53*1) y,(/-^®, y\ 2 ®) , y®, z®) , 

which with their derivatives y*!, z,i have continuous second partial derivatives 
in a neighborhood of the values (/, /®, y®, z®) belonging to the arc E. The argu- 
ment /® occurs only in the difference / /®, since the second members of equa- 

tions (52 • 5) do not contain / explicitly. Let /®, y®®, z®® be a fixed set of values 
belonging to the arc E. Since from the homogeneity property of tp 

ip{y, P) ^Pxipy'iiy, />) = 1 , 

the values P* = cannot vanish simultaneously; and we may assume, with- 
out loss of generality, that the notation is so chosen that (y®®, z®®) 9 ^ 0. 
Let ftvr (<r = 1, . . . , 2« — 2) be solutions of the canonical accessory equa- 
tions (52 • 10) along E with the properties described in Lemma 52 • 1. We define 
functions yj(a), zj(a) by the equations 

y0(a) =y®® + T?,^(/®)a<r, 

20 (a) =zj® + f..(/®)a. 

(f= 1 n ; f = 1, . . . , » — 1 ; er = 1, . . . , 2» — 2) , 

and a final function z®(a) by the equation 

H[yHa). z®(a)] = 0 , 



138 


LECTURES ON THE CALCULUS OF VARIATIONS 


which has the initial solution a = 0, Sn = C at which 7 ^ 0. The function 
2 ®(a) so determined has continuous derivatives of the third order, since H{yy z) 
has such derivatives. Since for each column rua, ftv of the determinant (52*11) 

it follows without difficulty that 2 i(a) has the derivative at a = 0. 

The {2n — 2)-parameter family of solutions of the canonical equations 

yi = Yi (/, a) = yi [t - y® (a) , z® (a) ] , 

Zi=Zi(/, a) =z<[/~/®, y®(a), z®(a)] , 

obtained from (53 1) by substituting the functions y®(fl), z®(a), is a family of 
extremals with the properties described in the imbedding theorem, Theorem 
45 1. It contains the arc E for the values h ^ t ^ = 0. Its variations 

relative to the parameter have the initial values and satisfy 

the canonical accessory equations, and hence are identical with t? »>(/), 

The determinant of the family analogous to (45 -3) is different from zero be- 
cause the determinant (52*11) has that property. Thus the conclusions of the 
imbedding theorem are proved. 

The imbedding theorem has to do with extremals and regions R' and S which 
are near to a given extremal arc E. Let us return to the somewhat more general 
regions described at the beginning of Section 52, The slope functions pJiy) of a 
field can be modified by a positive factor K{y) so that they satisfy the condi- 
tions ip[y, piy)] = 1. We have then the following theorem, which is funda- 
mental for the Hamilton- Jacobi theory: 

Theorem 53 *1. For every field F the invariant integral taken from a fixed 
point y® to a variable point y defines a function W(y) having continuous second 
derivatives in F and satisfying the partial differential equation 

(53*2) //(y, Wy) =0. 

Conversely, if in a region F a function W(y) with continuous second derivatives 
defines points (y, z) = (y, Wy) in S for every y in F, and satisfies the equation 
(53-2), then F is a field with the slope functions 

(S3-3) Pi{y) =P,[y, Wy] 

which satisfy the cdndition ^[y, p(y)] = 1. 

To prove this, one should note, first of all, that the condition fl(y, z) = 0 
is a necessary and sufficient condition for the equations 

= fvi (3^1 y') I ^ y') = 1 

to have a solution y' when y and z are given. This is a consequence of the 
equivalence of the systems (52*1) and (52*2). Furthermore, the condition is 
necessary and sufficient for the equations 

2i=*/vi(y» y') 
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by themselves to have a solution y' when y and z are given, since a set y, y' 
which satisfies these equations can always be made into a set which satisfies 
^ = 1 by multiplying t&e variables y' by a suitable factor k > 0. All of the 
equations £r(y, z) = 0 obtained by using different functions ^(y, y') in the 
equations (52 • 1) are equivalent because of the property which has just been 
proved. 

The first part of the theorem is now an easy consequence of the definition 
(53-4) W(y) = T/yily, P(y)]dy, 

•/yO 

of lV{y)f since it follows from this equation that 
(53-5) = Piy)] 

and hence that the equation (53*2) is satisfied. Conversely, suppose that W{y) 
has the properties described in the last part of the theorem. Then the functions 
pi{y) defined by equations (53 *3) satisfy the equations (53 *5). Consequently, 
the integral (53 *4) is independent of the path in F, and F is a field. 

The equation (53 • 2) is called the partial differential equation of Hamilton 
and Jacobi, The surfaces W(y) = constant, where W is the function defined for 
a field F as in Theorem 53 1, are the transversal surfaces of the field. These 
surfaces have properties like those described in Theorem 28 • 3 and provable in 
the same way. 

There is a theorem for the parametric problem which describes the determi- 
nation of extremals by means of a so-called complete integral W{yy 6i, . . . , bn-i) 
+ c of the Hamilton-Jacobi equation. In this theorem, whose results are analo- 
gous to those of Theorem 29 • 1 and Section 38 for non-parametric problems in 
(» -f- 1) -space, the function W involved is supposed to have continuous partial 
derivatives of at least the third order and to have its matrix of rank 

» — 1 at all sets (y, b) such that y is in a region F of y-space and 5 is in an 
(n — l)-dimensional region B. Furthermore, the sets (y, z) = (y, Wy) corre- 
sponding to such points (y, b) are all supposed to be admissible. 

Theorem 53 • 2. If for points y and b in regions F and B the^ complete integral 
W(y, h) + c of the Hamilton-Jacobi equation {5 3 '2) has the properties described 
above, then every admissible arc En interior to F, having no corners and satisfying 
the equations 

(53-6) Wb^{y,b)=ar (r = 1, . . . , w - 1) 

for particular values aro, bro, is either already an extremal or else becomes one when 
its parameter t is changed in sign. It follows from this, also, that the equations 
(53-6) have as solutions a (2n — 2)-parameter family of extremals yi(t, a, b) 
which contains Ew and has the properties described in the imbedding theorem, 
Theorem 45 • 1. 
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To prove the theorem we note first that the equations 
(S3-7) , Hy, + H,,Wy,y,^0 

are consequences of the equation (53 *2). Furthermore, along the arc £12 the 
functions yi, Zi = Wyi satisfy the relations 

(53*8) W,^y,y,^0, z[^Wy^,yl. 

These, with equations (53 *7), show that the derivatives yl, 2 ,' along £12 are 
proportional to If the paramater t is properly transformed, and 

changed in sign if necessary, the proportionality becomes equality. The arc £12 
then satisfies the canonical equations (52*5) and has B = 0 along it from 
(53 *2), and hence is an extremal. 

To establish the second part of the theorem we first prove that the equations 
(53*6) are satisfied by a family of extremals y<(/, a, b) with the continuity prop- 
erties described in the theorem. These equations have as an initial solution the 
set of values yio, OrO) bro belonging to an arbitrarily selected parameter value /q 
on £ 12 , and at least one determinant of the matrix ||PF 5 ,„<|| is different from 
zero at this solution. Hence by the usual implicit function theorems the equa- 
tions (53 * 6) have a set of solutions y< = iy<(a, b) with continuous second deriva- 
tives near (oo, bo) and such that bo) = yio* The existence theorems for 
differential equations then tell us that for the equations 

(S3*9) Wy(y, b)] 

there exist unique solutions y<(/, a, b) determined by the initial values Tji(a, b) 
at / « /o and having the desired continuity properties. On account of the first 
equation (53 * 7) and (53 * 9) the derivative Wbr is constant along these solutions, 
and the value of the constant is ar, since the equation (53 *6) is satisfied at 
to. 

With the help of Lemma 52* 1 we can further prove that the determinant 
(45*3) is different from zero along the arc £ 12 . For, if we define functions 
Zi{tf a, b) in terms of y»(/, a, b) by the equations Zi = Wyi, then the resulting 
curves are extremals and the sets yia„ Ziar and yibn Zihr form 2» — 2 solutions 
of the accessory canonical equations (52*10) on which X » 0, according to 
the remarks in the paragraph preceding Lemma 52 *1. For these solutions we 
form the determinant 

y'i 

described in Lemma 52 -1. When we substitute the values 


iia, - ® , 
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it follows, after a simple reduction, that the value of the determinant is a power 
of — 1 times the product 




f* — Wyj^y.rii 

y^. yi 


But this product is equal to unity, as one sees with the help of the easy conse- 
quence Whrvhyka, = 5r« of equation (53*6), the second equation (53*8), the 
canonical equations (52*5), and the fact that Hyir)i+ = 1 in Lemma 
52*1. 

54. The construction of a complete integral of the Hamilton- Jacobi equation. 
In the preceding paragraphs it has been shown that, when a complete integral 
W{y, 6) + c of the Hamilton-Jacobi equation (53 *2) is known, a (2» — 2)- 
parameter family (45 * 2) of extremals can be determined by solving the equa- 
tions Whr = Conversely it is true that, when the extremal family of the 
imbedding theorem is known, a complete integral can be constructed. These 
results are described in the following theorem: 

Theorem 54 *1. Every non-singular €xtre$nal arc E 12 can he imbedded in a 
(2n — lyparameter family of extremals yi(t, a, b) with the properties described 
in the imbedding theorem^ Theorem 45 1, and with the further property that for 
every fixed set of values br (r == 1, . . . , n — \) the family is a Mayer family with 
the variable parameters a,. Every region F of y-space which is simply covered with 
determinant | yia, y i 1 different from zero by such a Mayer family is a field with 
the slope functions ^i(y^ b) of the family. The value W(y, b) of the invariant inte- 
gral I* taken from a fixed point yo to a variable point y in the field defines a com- 
plete integral W(y, h) + c of the Hamilton-Jacobi equation with the properties 
described in Theorem 53' 2, Every complete integral is definable in this way. 

To prove this theorem let us consider the equations (49 • 6) with the function 
U replaced hy U + arffr- The resulting equations have solutions ar(a, ff), 
br(a, 0), r(a, 0) having contmuous second derivatives near the values (a, 0) = 
(tto, 0) and such that ar(ao, 0) = Oro, ftr(ao, 0) = bro, T{ao, 0) = /o, according 
to the argument in the last paragraph of Section 49. When the functions 
aria, 0), bria, 0) are substituted, the family (45 *2) has functions of the form 
(54*1) yiit, a, P) . 

They define the arc E for values h^t^ / 2 , Or = Uro, jSr = 0 and have con- 
tinuity properties like those of the functions (45 *2). 

The determinant analogous to (45*3) for the functions (54* 1) is the product 
of (45*3) by the determinant 

j ^ 

bra^ brP, 0 
0 1 

0 0 


(54*2) 


0 

0 


0 

0 

1 
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and is different from zero if this last determinant does not vanish. If the de- 
terminant (54 *2) were zero at the initial values (a, $) = (ao, 0), we should 
have constants c., not all zero, such that 

H” = 0 , "f* brfi^dt =* 0 • 

But from equations (49 -6), with Uar + Pr hi place of Uan 

yiarO'ra^ + yih^ bra^ + y^Ta, = Y , 

yiar(^r0^ + yibr^rfi, + y'J'fi, * ^ • 


Y iykaf^r0^ “f“ ykh^btfi^ “f“ yk • 

Multiplying the first of these equations by c,, the second by and adding, 
we see that we should then have Cg = T 0 sdg = 0 because the (» — l)-space 2) 
and the extremal E are not tangent and the determinant | Yi^r y< | is therefore 
not zero. Multiplying the last equations by dg and adding shows that we should 
also have dr = 0, which with = 0 is a contradiction. 

Thus the family (54* 1) contains the arc E and has all the properties of the 
family (45 * 2) of the imbedding theorem with parameters a, in place of o, b. 
Furthermore, we can see that for every fixed set of sufficiently small values p 
this family is a Mayer family with the variable parameters a, because of the 
arguments in Section 49. In every region F of y-space simply covered by the 
family with determinant ly^ar yi\ different from zero, the equations 

(54-3) y<(/, a, p) ^yi 

have solutions /(y, j8), ar(y, P) with continuous second derivatives for y in P 
and for P sufficiently small. The slope functions 

(54-4) piiy, p) =yr[/(y, p) , a(y, P) , p] 

have their determinant \Pi 0 r pi\ different from zero. Otherwise there would be 
multipliers Cr, not all zero, such that 

<^rp%fir'^ ^Pi’^ 0 . 

But from equations (54*3) and (54 -4) we have 

0 = y.-.,awi. + yi3. + y'A , 

pi»,^ yia^o-rfi, + y\ + y < to, 

from which it would follow that the constants 

, Cg , c,to, , c 
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would satisfy the linear equations whose coefficients are the rows of the determi- 
nant analogous to (45 *3) for the family (54 -3). This would require c* = c = 0, 
which is a contradiction. 

The value of the integral /* taken from a fixed point yo to a variable point y 
in the field F is now a function IF(y, /3) with the properties described in Theo- 
rem 53 *2. It has the derivatives 

= A; [^1 Phy /3) ] , ^vfir “ fv\nPkfi, • 

The matrix of derivatives is of rank » — 1. Otherwise there would be 

multipliers Cr, not all zero, satisfying the equations 

= 0 ; 

and the linear combinations CrpkBr would necessarily be proportional to the 
values y[ = pk, which is impossible since the determinant | pifi^ pt | is different 
from zero. 

The last statement of the Theorem 54 1 is easily proved. For every fixed 
set of values b a complete integral Wiy^ b) + defines a field in the region F with 
slope functions piy, b) = Wyiy, b), according to Theorem 53 - 1. The extremals 
of these fields form a (2n — 2)-parameter family defined by functions y,(/, o, b) 
with the properties described in Theorem 54 *1. Thus the theorem is completely 
proved. 

S5. Other theories of parametric problems. There are a number of methods 
for the discussion of parametric problems of the calculus of variations other 
than that presented in the preceding sections. In the present section a brief 
description of some of these will be given, with comments on their relative 
convenience. 

Since the choice of a parameter on a parametric arc can be made quite arbi- 
trarily, it turns out to be reasonable to formulate the parametric problem as 
that of finding, in a class of arcs y,(/) (0 g / g h) joining two given points 
i and 2 in y-space and satisfying an equation of the form 

(55-1) ^(y, y') = 1 , 

one which minimizes the parametric integral / in equation (42 1). The func- 
tion ^ is supposed to be positive in the region F of points (y, y') under considera- 
tion and, like the integrand function /, is assumed to be positively homogeneous 
with order one in the variables y'. 

The equation (55 • 1) is really only equivalent to a specification of the param- 
eter to be used on a particular arc y(r) (ri ^ t g T 2 ). For, if on this arc 
the function ^[y(T), y'(r)] is represented by ^(t), then in terms of the new 
parameter 

/ = y* }P(t) dr 
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the arc will be defined on an interval 0 ^ ^ ^ /s and will satisfy the equa- 
tion (55*1). 

Formulated in the manner described in the next to the last paragraph above, 
the parametric problem becomes, in the space of points (/, y), a non-parametric 
problem of the type usually designated by the name of Lagrange. It is a special 
case of the problem of Bolza, which will be discussed in a later chapter of this 
book. The theory of such problems is not so simple as that of problems having 
no differential equations, such as (55 *1), as side conditions. It is complicated 
further in the problem formulated above because in (/, y)-space the second 
end-value is not fixed. At the initial point the values / = 0, y,(0) = yn are 
all prescribed; but at the second end-point we require only = y,- 2 , leavmg 
the coordinate fe variable. 

In spite of these complexities, the theory of the problem as a problem of 
Lagrange has been very suggestive for the developments of the preceding pages. 
If we use the notation G = / + /^, the equations for the extremals and canonical 
variables of the Lagrange problem are, respectively. 



2i==Gy;= /y;+ = 1 • 

For the special case when (p = (yl these suggested the treatment of the 
equations of the extremals in Section 45 and the equations (52 * 1) defining ca- 
nonical variables. 

For the case when the integrand function / is positive in the region R of 
points (y, y') in which the continuity properties of / are prescribed, Wren has 
devised a quite different method. The function g(y, y') defined by the equation 
2g » /* is positively homogeneous of order two in the variables y'; and from a 
well-known property of homogeneous functions 

2g = gii(y.y') y'iy'i , 

where ga is a notation for the derivative d*g/ dyl dyj. The integral (42 • 1) is 
then expressible in the form 

/ = /*' (2g) = /‘* igiiy\y',) . 

The length integrals in the theory of Riemannian geometries and the integrals 
which play a fundamental role in the theory of relativity are of the form last 
given with coefficients ga which do not contain the variables y^ Analogous the- 
ories have been studied for the more general integrals I for which the coefficients 
gii contain the variables y'. A space in which the length of an arc is defined to 
be the value of such an integral is called a ^Tinsler space.” In all of these the- 
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ories tensor anal)rsis plays an important role. The formulation of Wren seems 
especially well adapted Jto the use of this mechanism. It is interesting to note 
that the defining equations for canonical variables and the form of the Hamil- 
ton- Jacobi theory, presented by Wren, were suggested by the formulation of the 
problem as a Lagrange problem with the function tp in equation (55 • 1) equal to 
the integrand function / itself. 

In the discussion of the accessory equations (47 • 2) and their solutions fre- 
quent use was made of the so-called ‘‘normal solutions” i;,, which satisfy the 
equation y[r\i = 0. In the theory of the parametric problem as a Lagrange 
problem it turns out that a role equivalent to that of this last equation is played 
by the equation 

(55*2) + 0 . 

The use of either one of them is compatible with the fact that the accessory 
equations are not independent, as indicated by the third equation (43*7). 
Hestenes has used the equation y[ rii = constant, which is closely related to the 
equation (55 *2) for the special case when tp = (yi y[y^^. He has also called at- 
tention to the fact that a large class of equations other than (55 • 2) can be used 
in the same way. 

There is one final method of reducing a parametric problem to a non-para- 
metric problem in a neighborhood of an arc E not intersecting itself whose mini- 
mizing properties are to be studied. If the functions defining E are yt(/)(/i g 
^ ^ / 2 ; i = 1, . . . , »)i let us select w — 1 vectors f,r(0(^ = 1, 1) 

such that for each value of t on the interval t ^ h the determinant 

(55-3) \y\{l) 

is different from zero. The equations 

(55-4) = +Wrtir(/) 

then have as initial solutions the values (y, /, u) = [y(/), /, 0] belonging to the 
arc Ey and at these solutions the functional determinant of the second members 
is the determinant (55 -3), which is different from zero. From the usual implicit 
function theorems it follows that the equations (55 *4) define a one-to-one corre- 
spondence between the points y of a neighborhood F of the arc E and the points 
(/, w) of a neighborhood T of the values (/, u) belonging to E. The solutions /(y), 
Uriy) of the equations (55 *4) defined over F will have continuous derivatives of 
as many orders as are possessed by the functions yi(/), provided that the vectors 
iirif) are chosen to have continuous derivatives of the same orders. 

After the transformation (55 *4) the equations of the arc E take the simple 
non-parametric form Wr = 0 (r = 1, . . . , n — 1) in (/, M)-space, and the inte- 
grand /(y, y') becomes a function g{ty «, «')• All of the necessary and the suffi- 
cient conditions for an arc to minimize the transformed integral / in non-para- 



146 . LECTURES ON THE CALCULUS OF VARIATIONS 

metric form are applicable to E in {/, w)-space. For a complete theory of the 
original parametric problem it is necessary to extend the condition of Weier- 
strass and the sufficiency proofs to include all parametrically admissible direc- 
tions and curves and, further, to interpret the results for the non-parametric 
problem in terms of the original coordinates y and integrand function /(y, y')- 
This method of transformation of coordinates is an interesting and some- 
times very useful method, which has, however, features which make it not al- 
ways desirable. In the first place, the determination of a transformation (55*4) 
with the desired properties is difficult and sometimes quite impractical if the 
arc £ has corners. To preserve for the transformed integrand g(t, m, v!) the con- 
tinuity properties originally assumed for the integrand function /(y, y'), it is 
necessary that the functions yi(0 defining the arc E should have continuous 
derivatives of at least the fifth order. The interpretation for the parametric 
problem in y-space of the results for the non-parametric problem in (/, «)-space 
and the extension of the condition of Weierstrass and the sufficiency theorems 
mentioned above involve labor comparable with that required in making the 
proofs directly, as in the preceding sections of this chapter. 



CHAPTER VI 

PROBLEMS WITH VARIABLE END-POINTS 


56. Introduction. The problem of the calculus of variations considered in 
the preceding chapters is concerned with the determination of a minimizing arc 
for an integral 

(56-1) ^ = X Vu. y. Z, y, z') dx 

of the form (1 *6) in a class of arcs joining two fixed points, 1 and in x^^z-space. 
Such a problem is said to have fixed end-points. A natural modification would be 
to seek a minimizing arc in a class of arcs joining a fixed surface to a fixed point, 
or a fixed surface to a fixed curve. Other similar problems suggest themselves 
at once. They are called “problems with variable end-points,” for reasons which 
are evident. 

The brothers James and John Bernoulli were apparently the first to consider 
a problem with variable end-points. About the year 1697 they sought to deter- 
mine in a vertical plane an arc down which a particle will fall from a fixed point 
i to a fixed curve L in the shortest time, when started at 1 with a given initial 
velocity. Euler, in his famous book of 1744 on the calculus of variations, studied 
much more general problems with variable end-points but did not formulate his 
end-conditions clearly. About ten years later Lagrange and Borda found that the 
problem of the Bemoullis described above, with first end-point variable instead 
of the second, or both end-points variable on fixed curves, had quite new char- 
acteristics. As a result of his study of these special problems, Lagrange devised 
others with variable end-points of great generality, including those before de- 
scribed by Euler, and found first necessary conditions for a minimum. Not much 
further progress was made until about the year 1898, when Kneser defined what 
is called a “focal point” and proved a sufficiency theorem for a general problem 
in the plane with one end-point variable. His focal point is the analogue, for a 
problem with one variable end-point, of the conjugate point for a fixed end-point 
problem. In 1902 and 1904 the author of these pages discussed the dependence 
of the focal point upon the curvature of the initial end-curve and treated in de- 
tail problems in the plane for which both end-points are allowed to vary on 
curves. Within the past few years a similar discussion of a very general problem, 
formulated by Bolza in 1913, has been given by Graves, Hestenes, Bliss, and 
others. Among the most general problems of the calculus of variations so far 
formulated, the problem of Bolza is in many ways the most convenient in form. 
Its theory is presented in Part H of this book. 
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The purpose of the present chapter is to present, in the most general form yet 
devised, the theory of the problems of the calculus of variations with variable 
end-points associated with the problems with fixed end-points which have been 
studied in preceding chapters. In Sections 57 and 58 the problems in o^z-space 
with second end-point fixed and first end-point variable on a surface or a curve 
are studied in a preliminary way by relatively geometrical methods closely re- 
lated to those of the preceding chapters. In Sections 59-64 a very general prob- 
lem with variable end-points in (x, yi, . . . , yn)-space is formulated, and the the- 
ory of necessary and of sufficient conditions for a minimum is developed. The 
fourth necessary condition which appears in the theory, developed in Section 
64, is a modification by Goldstine of an earlier one given for the problem of Bolza 
by Hestenes. Sufficiency theorems are proved in Sections 60 and 63 with the 
help of an important theorem due to Hahn and with the aid, further, of the the- 
ory of the so-called “extremal integral.” When only one end-point is variable, 
the condition which is usually called the “fourth necessary condition” can be 
stated in terms of the focal point of the manifold on which the initial end-point 
is allowed to vary. The theory of these focal points and the dependence of the 
focal point of a curve on the first curvature of the curve are studied in Sections 
65 and 66. Finally, in Section 67, some interesting theorems concerning problems 
in the (x, y)-plane with variable end-points are deduced. 

57. Problems in three-space with one end-point variable on a surface. The 
problem to be studied in this section is that of finding in the class of admissible 
arcs joining a fixed surface 5 to a fixed point 2 one which minimizes the integral 
I in equation (56*1). The properties which characterize a minimizing arc can 
readily be deduced by methods similar to those applied in the preceding chap- 
ters for problems in three-space with fixed end-points. 

Let £i 2 represent, as before, a particular admissible arc whose minimizing 
properties are to be studied. It will be supposed that the fixed surface S is defined 
by functions 

(57*1) {(a, /S), 17 (a, /3), f(o, /3), 

which have continuous derivatives of the third order near the values (ai, ffi) 
which define the intersection pomt I of 5 and £ 12 . 

Since every admissible arc joining the end-points 1 and 2 of £12 also joins the 
surface S with 2, it is evident that £12 must satisfy all the necessary conditions 
for a minimizing arc in the class of arcs joining its end-points — in particular, 
the necessaiy condition I of Section 6 and the necessary conditions II and III 
of Weierstrass and Legendre of Section 9. It will be seen presently that a further 
necessaiy condition, called the “transversality condition,” must be satisfied at 
the intersection point i of 5 and £12 and that the condition of Jacobi for the 
fixed end-point case can be replaced by a stronger condition of a similar char- 
acter. 
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In order to deduce the so-called “tranversality condition,” let T be an arbi- 
trary curve on S through the point 7, defined by functions a(a), P(a) which 
have continuous second derivatives near the parameter value ai defining the 
point i. The curve T can be joined to the point 2 by a one-parameter family of 
admissible arcs containing £12 as a member, and Theorem 8 - 1 of Chapter I is 
applicable to the family. For example, when the arc £12 is defined by functions 
y{x), z{x) (xi^ X ^ X 2 ), such a family is defined by functions 
y (*) + { [o (a) , /S (a) 1 - y ({) } ^ (x, o) , 

2 (*) + { f [a (a) . |8 (o) ] - z ({) } ») (x, a) , 

where 

X X2 

“{[a (a), 

The arcs of this family intersect the curve F for a; = { [o(a), 13(a)] and pass 
through the point 2, and the family further contains the arc £12 for the parameter 
value a = ai. According to Theorem 8 1, the value of the integral 7 taken along 
a curve of the family is a function of the parameter o, whose differential at the 
value fli defining £12 in the family is 

dJ = — / djc — (dy — y'dx) fy* — (dz — z^dx) | ^ , 

where the differentials dx^ dy^ dz belong to F and the element (x^ y, 2 , y', s') in / 
and in its derivatives belongs to £ 12 . If 7 (£ 12 ) is to be a minimum, this differential 
must vanish; and since F is an arbitrary curve on 5 through the point 7, the 
following theorem is established: 

Theorem 57 1. The Transversality Condition. An admissible arc E 12 
is said to satisfy the transversality condition at its intersection point 1 with a sur- 
face S if the equation 

(/“■//v' - 2 '//) dx+ fy>dy + f,>dz = Q 

is satisfied by every direction dx:dy:dz tangent to S at the point 1, the arguments 
X, y, z, y', z' in f and its derivatives being those of the arc E 12 at 1. Under these cir- 
cumstances the surface S is said to cut the arc £12 transver sally at the point 1, ac- 
cording to the definition of Section 28. Every minimizing arc for the problem of this 
section must satisfy the transversality condition. 

The necessary condition analogous to that of Jacobi, which is now to be de- 
duced, applies only in case the minimizing arc £12 is a non-singular extremal arc 
cut transversally by the surface S at the point 7 and not tangent to 5 at the 
point 7. Besides these properties, we shall also assume that the surface S is non- 
singular at the point 7. Then there is a two-parameter family of extremals 
(57-2) y(x, a, jS) , s (x, a, fi) 

containing £12 for parameter values (ai, ft) and having the following properties. 
The members of the family are cut by S transversally at « = {(a, ft. The func- 
tions (57 • 2) and their first and second derivatives with respect to x have contin- 
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uous second partial derivatives for values (x, a, 0) in a neighborhood of those 
belonging to £12. Finally, the determinant 

Mx, a, 0) ^yaZ0-yfiZa 

is not identically zero along £12. The existence of this family is a consequence of 
the arguments already applied to equations (20 -7) for the particular case when 
the function W in those equations is identically zero. The determinant A of the 
family is different from zero at the point 1 on £12, since the identities 
17 (a, 0) -yiii a, 0) , f(a, 0) = z (f, a, 0) 
shows that at « = {(a, 0) 

ya“ yia y^ (a > Za = fa Z^ fa , 

= 7?^ — y'f/j , z/9= f/5— z'{^ . 

As in the argument following equations (20 -7), the determinant of these quanti- 
ties is different from zero at the point 1 when S has an ordinary point and £12 
is not tangent to 5 at i, as has been presupposed. 

Definition of Focal Points. Let £12 be an extremal arc cut transversally 
by a non-singular surface 5 at the point 1 and not tangent to S at 1. If A(x, a, 0) 
is the determinant of the two-parameter family of extremals (S7 • 2) cut trans- 
versally by the surface S and containing £12 for the parameter values ai, ft, then 
the points determined on £12 by the zeros of A(jii:, ai, /3i) are called the focal 
points of S on £12. As we have seen in Section 13, they will, in general, be points 
of contact of £12 with the enveloping surface of the family (57 • 2). 

It is not difficult to see that the focal points of the surface 5, so defined, are 
independent of the parametric representation chosen for S, For, if in the func- 
tions (57 * 1) defining S the parameters a, 0 are replaced by functions a (7, 5), 
0(y, 8) of two new parameters, the family (57 • 2) with the original parameters 
a, 0 will go into a second family with similar properties. The determinants 
A(x, a, 0) and A(x, 7, 6) of the two families will differ only by a non-vanishing 
factor independent of x. 

The argument used in the proof of Theorem 13 * 2 shows that, except in special 
circumstances, there will be associated with a focal point on £12 a one-parameter 
family of extremals belonging to the family (57-2), containing £12, and having 
an envelope which touches £12 at the focal point. The envelope theorem. The- 
orem 10-2, then holds for this family; and the method used in Section 10 for the 
proof of Jacobi’s condition justifies the following theorem: 

Teieorem 57 *2. The Focal-Point Condition. A non-singular extremal arc 
Ei 2 cut transversally by a non-singular surface S at the point 1, and not tangent to S 
at 1, is said to satisfy the focal-point condilion if there is no focal point of S on E12 
between 1 and 2. Every non-singular minimizing arc for the problem of this section 
must be an extremal and satisfy the focal-point condition. 

The proof of this theorem indicated above has the same limitations as the 
corresponding geometric proof of the Jacobi condition m the fixed end-point 
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case. However, as in the case of fixed end-points, the theorem can be proved with 
greater generality by the use of the second variation. This will be done for a 
more general problem in Section 60 below. 

The sufficiency theorems for the problem of this section are readily deducible 
from the reasoning in Sections 16 and 19 of Chapter II. The Roman numeral I 
will be here used to designate the necessary condition of Section 6 plus the 
transversality condition of Theorem 57 *1. This grouping is justified by the fact 
that the two analogous conditions for the more general problem of Bolza, which 
will be discussed in Part II of this book, are provable together, and together con- 
stitute the famous multiplier rule for that problem. The notations IIa^ and III' 
will denote, as in Section 16, the strengthened conditions of Wcicrstrass and 
Legendre; and the symbol IV' will be used to denote the focal-point condition 
strengthened so as to exclude the point 2 on £12 from being a focal point of 
5. Sufficient conditions for a minimum are then as follows: 

Theorem 57*3. Lei E12 he an admissible arc without corners cut at a single 
point \ by a surface S which is non-singular and not tangent to the arc E12 at If 
Ei2 satisfies the conditions 7, 7/7', 7 V' defined above in this section^ there is a neigh- 
borhood Ri of the values (x, y, z, y', z') belonging to E12 su>ch that the inequality 
I(C32) > I(Ei2) holds for every admissible arc C32 in 'S.i joining S with 2 and not 
identical with E 12 . 7/Ei2 w non-singular and satisfies the conditions 7, 77n, 7F', 
then there is a neighborhood F of the values (x, y, z) on E12 such that the inequality 
holds for every admissible arc C32 in F joining S with 2 aitd not identical with E12. 

To prove these results, one should first notice that the conditions I, III' im- 
ply, as in the fixed end-point case, that Evi is a non-singular extremal arc and 
hence belongs to a two-parameter family (57-2) of extremals cut transversally 
by the surface 5. The determinant ls{xy ai, /3i) of this family is different from 
zero, not only at the point 1 but at every point of £ 12 , since the surface S has no 
focal point on £12 according to the condition IV'. For a sufficiently small value € 
the extremal arcs defined in the family by values x, a, j8 satisfying the conditions 
Xi — e<x<X 2 + €, |a — ai|<€, |j3 — /3il<€ 

will simply cover an open region F of ocyz-space. No two of them can, in fact, 
intersect when c is small, as one sees by an argument similar to that of Section 
15. Furthermore, the usual implicit function theorems applied to the equations 
y = y (^, a, /8) , z = z{Xy a, jS) 

show that every point (jc, y, z) which is covered by the extremals, has a neigh- 
borhood which is also covered, so that F is an open region. For sufficiently small 
values € no one of the extremals will intersect the surface 5 more than once, 
since £12 itself has this property by hypothesis. According to Theorem 20 - 1, the 
region £ is a field with the slope functions of the family, since the extremals of 
the family are cut transversally by the surface The value of the Hilbert in- 
tegral 7* with the slope functions y, z), q{x, y, z) of the field is zero along 
every arc L in F on the transversal surface S, 
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Hie inte^gral formula 

(57-3) /(Cm) - /(Em) = / £(*, y, *. P, ?. /, *') rf* , 

analogous to the formula (19* 1), holds for every admissible arc Cj 2 in F joining 
the surface 5 with the point 2. For, if Zis be an arc on 5 in F joining the points 1 
and J, then it follows, from the properties of the invariant integral /* of the 
field, that 

/ (C32) - / (£12) = I (C32) - (£12) 

-/(C 32 ) -/*ai8+C32) 

= /(C32) ~/*(C32), 

since /*(I.i 3 ) = 0. When the integrals for /.(C32) and l*(Cz 2 ) are substituted in 
the second member of the last equation, the formula (57 • 3) is obtained. The re- 
mainder of the sufficiency proof is a repetition of the reasoning used in the proofs 
of Theorems 16 1 and 16-2. 

58. Problems in three-space with one end-point variable on a curve. The 
problem of minimizing an integral 1 in the class of admissible arcs joining a fixed 
curve £ to a fixed point 2 in acys-space has been studied by Bliss and Mason* for 
the parametric case. The theory of the problem is complicated, relative to that 
of the last section, by the fact that the two-parameter family of extremals cut 
transversally by the fixed curve L does not simply cover a neighborhood of Z, 
namely, the points on the curve L itself. In the following paragraphs a method 
suggested by Hestenes, and quite different from that of Bliss and Mason, has 
been adopted for the non-parametric case. It makes use of the results in the last 
section for the problem with one end-point variable on a surface. 

For the development of the theory of the problem here to be discussed, it can 
first be shown, by methods used in preceding pages, that for a minimizing arc 
£12 the conditions I, II, III described in the earlier paragraphs of this chapter 
must be satisfied. The transversality requirement in condition I follows from 
Theorem 57* 1 if in the proof and statement of this theorem we replace the sur- 
face S by the curve L. The curve L is supposed to be defined by functions 

{(a), 17(a), f (a) (a'^a^a"), 

which have continuous third derivatives and which are such that L does not in- 
tersect itself and has on it no smgular pomt. 

In order to formulate a condition IV for this problem with first end-point 
variable, we assume that the arc £» whose minimizing properties are to be stud- 

> **The Properties of Curves in Space Which Minimize a Definite Integral,” Transactions of 
tke^American Matkmatical Society, IX (1908), 440-66; “Fields of Extremals in Space,” Und.^ 
XL (1910), 325-40. 
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ied is a non-singular extremal arc cut transversally by L so that at the intersec- 
tion point 1 of £12 and L the condition 
(581) *0 

is satisfied. Furthermore, we assume that the function / is different from zero at 
the element (a;, y, z, y', 2 ') belonging to the point 1 on £ 12 . This condition im- 
plies that the curve L is not tangent to £12 at the point since equation (58 • 1) 
and the identity of the lines through 1 having the directions Jiy'iz' and {a-i/a* 
fa would imply that / = 0 at 1. Let us consider, then, a two-parameter family 
of extremals 

(58*2) y{x, a, /3) , z (x, a, jS) 

containing the arc £12 for values Xj a, 0 satisfying conditions of the form xi^ x 
^ X 2 , a — tto, P = Pot and such that each extremal of the family is cut trans- 
versally by the curve L at the point defined on the extremal by the value x = 
{(a). If the determinant A(x, a, /3) = yo.z$ — y^Za of the family is not identi- 
cally zero along £ 12 , a point defined on £12 by a zero of A{Xj ao, fio) distinct from 
Xi is called a focal point of the curve L on E 12 . The focal-point condition of The- 
orem 57 • 2 is then easily seen to be applicable to the present problem if in its 
proof and statement the surface S is replaced by the curve L. 

In order to prove the existence of a family of extremals (58 • 2) cut trans- 
versally by the arc L and having the properties described above, consider a di- 
rection l\m:n transversal to £12 at the point 1 and not in the plane determined 
by the tangents to L and £12 at that point. The equations 

- Hia + UTla+ Vfa= 0 , 

(58-3) 

— +ufn+ vn = P -- Pq y 


in which Pq is a quite arbitrarily selected constant and the arguments of the 
Hamiltonian function H are {x, y, z, «, v) = [{(a), 17(a), f (a), «, v], are similar to 
equations (20 *7) with the variables Xy y, z, y', z' replaced by the canonical vari- 
ables X, y, z, Uy V, They have the special solution (a, P, u, v) = (ao, /So, «i, Wi), 
where Ui and Vi are the values of /y# and /,» at the point 1 on £ 12 . At this solution 
the functional determinant of the first members of equations (58 *3) with re- 
spect to Uy V is 


(58-4) 


w — y'/ » — z'/ 


1 {« / 

y' 1 ?* m 

z' fa n 


since on the extremal arc £12 we have y' ~ £u, z' = H*. The last determinant 
is different from zero at the point 1 because at that point the three ratios (i : y' : z'), 
{ka'fia't^y {limin) determine lines which are not coplanar, according to the 
hypothesis made above. Consequently, the equations (58*3) have solutions 
u{a, P)y v(a, P) with continuous second derivatives near the values (ao, jSo) and 
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such that M(ao, jSo) = Uiy v(ao, Po) — Vi. The family of extremals obtained by 
substituting the initial values ((a), 17 (a), f(a), u(a, jS), v(a, 0) in equations 
(27 *4) is a two-paranieter family (58 *2) cut transversally by the arc L at the 
value X = f(a). 

If the Legendre condition III holds at the initial point 1 of the non-singular 
extremal arc E 12 , then the family (58 *2) has a determinant A(:c, ao, jSo) which 
vanishes at x\ but is different from zero near rci, so that there is no focal point of 
the curve L determined by this family on £12 near the point L To prove this we 
note first that the equations 

(58-5) 17 (a) =y[f(a), a, P] , f (a) = 2 [£(a), a, P] 

hold along the curve L, By differentiation with respect to a and P these equa- 
tions evidently imply that at the point 1 

y«= i7«-y'£«= ciy y/9= 0, 

Sa “ fo 2 “ C2i 2/J — 0, 


where Ci and C 2 are merely notations for what precedes them. By an application 
of Taylor’s formula to the column y^, of the determinant A(x, ao, /3o) it fol- 
lows that this determinant is the product of (x — xi) by a factor which reduces 


oXx — Xi to 
(58/6) 


p- y',\ 

I 2a 2 ' I 1^2 2 ' I 


Differentiation with respect to P of equations (58 • 3) with their solutions M(a, jS), 
t;(a, P) substituted gives the equations 

CiUfi+ 0, (w — £«/) (w — H®/) = 1 , 

which show that the derivatives are not both zero and that the determi- 
nant (58 *6) is Ufiyfi + except for a non-vanishing factor. But 
u (a, P) = /y' { € (a) , 1? (a) , f (a) , y' [ € (a) , a, /3] , z' [{ (a) , a, j5] } 
with a similar expression for v, so that 

Since vp are not both zero, it follows that this last expression is different from 
zero because of the non-singularity of £12 and the condition III, which is sup- 
posed to hold at 1. Evidently A(af, ao, Pq) is zero at z = xi but different from zero 
elsewhere near that value. 

It should perhaps be noted that the assumption of the validity of the condi- 
tion III at the point 1 on the arc £12 is more than is necessary to insure that the 
determinant A(x, ao, po) does not vanish near x ~ xi. It would suffice merely to 
assume that the particular expression (58*7) is different from zero at the 
point L 
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The following theorem is analogous to Theorem 57*3: 

Theorem 58-1. Let E12 be an admissible arc without corners in xyz-space, cut 
at a single point 1 by a non-singular curve L and such that the integrand function f 
is different from zero at the point 1 on E12. Then for the problem of minimizing the 
integral I in the class of admissible arcs joining the curve L to the point 2 the con- 
ditions 1, 11 r, IV' of the preceding paragraphs are sufficient for I(Ei 2 ) to be a 
weak relative minimum^ and the conditions /, //n, IV' with the non-singularity of 
Ei 2 are sufficient for I(Ei 2 ) to be a strong relative minimum. 

The proof of this theorem is an immediate consequence of Theorem 57 • 3 
and the following lemma: 

Lemma SSI. If a non-singular extremal arc E 12 is cut transversally at the point 1 
by a curve L, has i Oat the point 1, and contains no focal point of L, then through 
the curve L there is a non-singular surface S transversal and not tangent to E 12 at the 
point 1, and such that on E 12 there is no focal point of S. 

It is easy to see from the results of Section 57 that when this lemma has been 
proved the minimizing property of the arc En will be established. From The- 
orem 57 -3 it follows, in fact, that En will be a minimizing arc in the class of ad- 
missible arcs joining the surface S to the point 2, and hence also in the class of 
such arcs joining the curve L to 2. 

To prove the lemma we may consider the functions .4 (a, jS), B(a, jS), C(a, j8) 
defined by the equations 




A =/ 




in which I, w, n are the values appearing in equations (58 • 3) and the arguments 
of Hu, Hv,f are those associated with the solutions w(a, j3), v(a, ff) of those equa- 
tions. The surface defined by the functions 


(58*8) 


{0 (a, jS, t) = { (a) P) dp 

Vo (a, jS, c) = 17 (a) ^ 


fo (a, /3, c) = f (a) +c J' C (a, P) dP 


has continuous derivatives of at least the second order in a, P, c, contains the 
curve L for jS = Po, and is non-singular along that curve, provided that e 9 ^ 0 , 
as one may readily verify. The equations 

— H(oa + UV0a+ »foa= 0 , 

(58-9) 

11 A + uB + vC =0 

with the arguments (x, y, s, «, v) = ({ 0 , Vo, f 0 , w, v) are equivalent to equations 
(58 -3) when € =» 0 and therefore have the initial solutions (a, P, €, w, v) = [a, P, 
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0, u (a, jS), o(a, jS)] on which the functional determinant of their first members 
with respect to «, v is (58-4) and different from zero near the values (ao, /9o)> 
Hence these equations (58-9) have solutions «(a, /3, c), v(a, 0, c), with continu- 
ous derivatives of the second order and reducing to the solutions u{a, 0), v(a, 0) 
of (58 - 3) for « “ 0. When substituted with f oi vo, to from (58 - 8) as initial values 
in the functions (27-4), these solutions define a three-parameter family of ex- 
tremals 

(58-10) Y(.x,a,0,e) Z(x, a. 0, t) . 

When e ^ 0 this family is cut transversally at * = {o(tt, 0, «) by the surface 
(58 -8), on account of equations (58-9), in which 

(ofi—tA, ri(i0 = eB , tofi — ^] 

and when e = 0 it contams the family (58 - 2) cut transversally at * = {(a) by 
the curve L. As a result of these properties 

Y ({oi a. 0t «) = 70 (tt. 0,t), Z (fo, a, 0, e) = to(a, 0, «) , 

(58-11) 

F (x, a, 0, 0) =y (x, a, 0) , Z (x, a, 0, 0) = z (x, a, 0) . 

The determinant A(*, a, 0, e) — YJLg — YgZa for the family (58 - 10) has now 
the expansion 

(58-12) A(a:, aoi 0o, e) =A(x, ao, 0o, 0) -f-oA, (x, ao, 0o, ®«) 

( 0 < 0 < 1 ), 

in which the first term is the corresponding determinant for the family (58-2) 
and consequently different from zero on the interval xi< x ^ xt because of 
condition IV'.' Furthermore, by differentiating the first two equations (58 -11) 
and using equations (58-8) the value of A<(xi, ao, 0o, 0) can be shown to be the 
determinant (58-4), which is different from zero. Consequently, there will be 
positive constants h, k such that A,(x, ao, 0o, Ot) will be different from zero for 
Xi^ X <Xi + h,0 < I ej <k. Hence for an e satisfying 0 < | ej <k and giving 
eA« the same sign as A(x, ao, 0a, 0) on the interval xi<x < xi + h, the expres- 
sion (58-12) will be different from zero on xi ^ x < Xi k; and it will also be 
different from zero on the whole interval xi ^ x ^ xo if c is still further restrict- 
ed so that on the interval xi A ^ x ^ xa the second term in (58-12) has ab- 
solute value less than that of the first term. This establishes Lemma 58-1 and 
therefore also Theorem 58-1. 

The method used by Mason and Bliss for the discussion of the problem of 
the calculus of variations with first end-point variable on a curve L depends 
upon an anal 3 rsis of the properties of the field covered by the two-parameter 
family of extremals cut transversally by the curve L, a field which is not simply 
covered along the curve L itself. In the method of the present section the critical 
step is the construction of the surface S through the curve L, transversal to the 
extremal Ea and having no focal point on Eu, as described in Lemma 58 - 1. In 
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the following sections a more general theory of problems with variable end- 
points is given, depending upon a theorem due to Hahn. This theory is also ap- 
plicable to the more special problems of the preceding section and the present 
one. 

59. A more general problem with variable end-points. The problem to be 
considered in this and immediately succeeding sections is that of finding a 
minimizing arc En for a generalization of the integral I of equation (35 - 1) in 
the class of admissible arcs yi{x) (xi ^ x ^ X 2 ',i = 1, . . . , n) whose end-values 
xi, y(xi), X 2 , y(x 2 ) lie on an r-dimensional subspace S of the 2(n + l)-dimensional 
space of points (:ri, y*i, X2, yi2)- Such a subspace 5 may be defined by functions 
of the form 

(59*1) x,itu . . . ytr) i yuitu . . . ytr) (r = l,2; r^2n + 2) 

for which the parameters / = (/i, . . . , /r) range over a bounded closed set T of 
points The functions (59 • 1) are supposed to have continuous third derivatives 
on the set T, and the space S is supposed to be non-singular and not to intersect 
itself. 

Besides the generalization to (« + l)-space and the generalization of the 
manifold on which the end-points Xiy y{x^ and X 2 y y{x^ lie we may also modify 
our problem further by adding to the integral 1 a function of the coordinates of 
the end-points, so that the functional to be minimized has the form 

y(*i). Xi, y (xj) ] + /" 7 (*. y, y') dx . 

•'*1 

This is the form in which the most general problem without side conditions so 
far studied was formulated by Bolza.^ Since on the end-space (59 - 1) the func- 
tion g depends upon /, we may write J in the form 

(59-2) / = ^ (0 + / 7(^, /) dx . 

The problem is then to find, in a class of admissible arcs y,(x) (xi^ x ^ X 2 ; 
i — !,...,«) satisfying the end-conditions 

(59-3) X. = (/), yiix,) =yiAt) (5 = 1, 2; i == 1, . . . , n) 

for a value t in T, one which minimizes the sum J in (59 -2). The form (59 *2) 
for J was introduced by M. Morse® and is in some respects more convenient than 
that of Bolza, since the accessory minimum problem associated with it appears 
at once in the same form, as will be emphasized again in a later section. 

Since, by h)q)othesis, the end-space S does not intersect itself, an admissible 

* “tJber den anormalen Fall beim Lagrangeschen und Mayerschen Problem mit gemischten 
Bedingungen und variablen Endpunkten,” Mathematische Annalen, LXXIV (1913), 430-46. 

• ''Sufficient Conditions in the Problem of Lagrange with Variable End Points,” American 
Journal of Mathematiesy LIII (1931), 517-46. 



158 


LECTURES ON THE CALCULUS OF VARIATIONS 


arc C with ends on S determines uniquely the value / belonging to its ends. The 
value /(C) of the sum (59 -2) is therefore uniquely determined by C, and /(C) 
is the function which is to be minimized. As before, we consider a particular ad- 
missible arc JBi 2 with ends on S whose minimizing properties are to be studied, 
and for convenience we let = 0 (A = 1, . . • , t) be the parameter values of the 
point on S belonging to En. The assumption of the non-singularity of the space S 
is equivalent to saying that the matrix of derivatives of the functions (59 • 1) 
with respect to the variables k is of rank r everywhere in T, The function g is 
supposed to have continuous second derivatives in 7*. 

Since every admissible arc joining the ends of En has its end-values on the 
space 5, it is clear that if £12 is a minimizing arc it must satisfy all the necessary 
conditions for the problem with fixed end-points, described in Chapter IV. 
More specifically, it must satisfy the condition I of Section 35 as well as the 
conditions II and III of Weierstrass and Legendre. Moreover, if the arc £12 is 
a non-singular minimizing arc without comers, it is an extremal, and there can 
be no point 3 conjugate to 1 on £12 between 1 and 2, 

If the arc £12 is a non-singular extremal, it can be imbedded in a 2«-parameter 
family of extremals y(x, a, b) of the form (36 1) with properties analogous to 
those described in Theorem 7 • 1. It is evident that if £12 is a minimizing arc the 
value /(/, a, b) of the sum / taken from x\{t) to Xi{t) along an extremal y(rr, a, b) 
must have a minimum at the values (/, a, b) = ( 0 , oo, 5o) belonging to £12 in the 
class of sets (/, a, b) satisfying the equations 

(59-4) yi{x,{t) , o, b] =yiAt) {5 = 1 , 2; f = 1, . . . , ») . 

The problem of determining the minima of the so-called ‘‘extremal integral” 
/(/, a, b) is a problem in the theory of functions of a finite number of variables. 

If the ends of the non-singular extremal arc £12 are not conjugate, the func- 
tional determinant of equations (59 *4) with respect to the variables bi at the 
initial solution ( 0 , ao, 5o) is the determinant D(x 2 ^ x^ Oo, 5o) of equation (36 *7) 
and different from zero. Equations (59*4) therefore have solutions ai(/), &»(/) 
which have continuous partial derivatives of the second order near / = 0 and 
which further take the initial values a<(0) = Oio, bi{0) = 6 , 0 . The functions 

(59-5) Yi(x, t) =^yi[x, a (/) , 5 (/) ] [xi{t) ^x^x^iO] 

now define an r-parameter family of extremals containing the arc £12 for / = 0 
and having their end-points on the end-space 5, as indicated by equations 
(59-4). From what has been said above, it is clear that the function 
(59*6) /(/) =/[/, a(/), bit)] 

must have a minimum /(O) if £12 is a minimizing arc. 

60 . A sttflElciency theorem for the more general problem with variable end- 
points. The purpose of this section is to establish a sufficiency theorem for the 
problem with variable end-points described in Section 59. The proof of the the- 
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orem depends upon an important auxiliary theorem due to H. Hahn.* We con- 
sider an extremal arc Eq in {x, y)-space which is imbedded in an r-parameter 
family of extremals (59-5). The theorem to be proved is to be applied to the 
particular family of the last section; but it is also applicable to other families of 
extremals, depending upon arbitrary numbers of parameters and having similar 
continuity properties. The theorem to be proved is as follows: 

Theorem 60-1. The Theorem of Hahn. Let Eo be a non-singular extremal 
arc imbedded for parameter values th = 0 (h = 1, . . . , r) m an r-parameter family 
(59 • 5) of extremal arcs of the kind described in the last section^ having on it no 
point conjugate to its initial point, and satisfying the strengthened condition //n 
of Weierstrass. Then there exist a constant € > 0 and a neighborhood F of Eo in 
(x, y)-spacesuch that for every extremal arc E of the family (59-5) whose param- 
eters t satisfy the conditions jthl < t the inequality 1(C) > 1(E) is satisfied for 
every admissible arc C in ¥ joining the ends of E but not identical with E. 

If Eo satisfies the condition 111 instead of //n, then there exist a constant € > 0 
and a neighborhood Ri of the elements (x, y, yO on Eo such that the last sentence 
above is true with F replaced by Ri and (x, y)-space replaced by (x, y, y')-space. 

In the proof of this theorem we use the canonical variables x, yx, Zi = fy\. 
From the imbedding theorem as proved in Section 38, we know that the non- 
singular extremal arc Eq is a member of a 2n-parameter family of extremals 

(60-2) yiix, a, b ) , Zi{x, a, b) 

in which the constants at, bi are the initial values of the variables yi, z* at a value 
xq on the x-interval belonging to Eo. We know, also, from the results of Sections 
23 and 39 that, since Eq has on it no point conjugate to its initial point, there ex- 
ists a conjugate system of solutions Vik(x), of the accessory canonical 

equations along Eo with |i7u(x) | 5*^ Oon the interval Xi ^ x ^ X 2 belonging to 
£0. The functions (60-2) with 

(60*3) ai^ Y i (Xqi /) "I” fjik (^0) ttfc b{^ Zi (Xo> /) *1” ^xk (^0) 

then define an (r -|- n) -parameter family of extremals 

(60*4) yi-yi{x, t, a), Zi-Ziix, t, a)' 

which contains the family (59 *5) for a = 0. The family (60 *4) contains the arc 
£0 for values (x, /, a) satisfying the conditions xi ^ x ^ X2, tk = 0, a* = 0. The 
derivatives y to*, Zia*. are equal to rji*, ft* at x = Xo, and along £0 satisfy the canon- 
ical accessory equations (39-12). Hence these equalities hold along £0 for all 
values of X, and we have |yt*ajbl = hifcl 5^ 0 along £0. It is evident, then, from 
the usual implicit function theorems, that the first n equations (60 -4) have solu- 
tions o*(x, y, t) with continuous derivatives of the second order for values (x, y, /) 

* “Vber Variationsprobleme mit variablen Endpunkten,” Monatsht^te fUr Mathemalik und 
Physik, XXII (1911), 127-36. 
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with («, in a neighborhood F of the arc Eo in «y-space and with | A | ^ c when 
F and c are sufficiently small. The functions 

(60*5) Pi{x, y, 0 =y\[x, t, o(*, y, /) ] 

are the slope functions of the family. 

For every fixed set of values t with | A | ^ « the slope functions (60 ■ 5) form a 
field in the region F, according to the results in Section 37. This is a consequence 
of the fact that on the hyperplane * = *o the integrand /v<dyi of the Hilbert 
integral /* is the sum baiai, which from (60-3) and the relations 

expressmg the conjugacy of the accessory extremals tia, tik has the value 
bidCi “ [Zi (xo» 0 "f" f t4 (*o) a*] Vii (*o) dat 

d [Zi (xo» 0 *?*'i (*o) (*o) Vii (*o) ataj] • 

The Hilbert integral is thus independent of the path on the hyperplane x = xo 
and therefore has the same property in the whole region F. 

As has been seen in Lemma 16-1, the condition Uat satisfied by the non-singu- 
lar arc Eo implies the condition IlJlr which says that the inequality 

(60-6) E[x, y, p{x, y, i) , y'] >0 

holds for all sets (*, y, p, y') with (x, y,p)mN and (*, y, y') admissible and dif- 
ferent from (x, y,p).H the region F and the constant e of -the preceding para- 
graphs are suitably restricted, then the inequality (60-6) will hold for all sets 
(/, X, y, y') with (*, y) in F, |/a| < «,and (x, y, y') (», y, p) and admissible. 

Hie conclusion of the first paragraph of the theorem now follows by an easy ap- 
plication of the fundamental sufficiency theorem. Theorem 19-2. 

If the arc Eo satisfies the condition III, it will also satisfy III', since it is non- 
singular. With the help of Corollary 9 - 1 extended to (« -f l)-dimensional space 
it can now be seen that there will be a neighborhood Fi of the sets (x, y, y') on 
Eo and a constant c > 0 such that the inequality (60 - 6) holds whenever (x, y, y') 
^ (», y, ^) is in Xi and 1 4 1 < c. Then for each fixed t satisfying the last inequal- 
ity the region F with the slope functions p(^x, y, /) will be a field to which the 
fundamental sufficiency theorem is applicable if we define admissible sets (x, y, 
y) to be those interior to Fi. Thus the statement in the last paragraph of the 
theorem is proved. 

By means of the theorem of Hahn we may establish the following sufficiency 
theorem for both weak and strong relative minima for the problem with variable 
end-points described in Section 59. 

Theokem 60-2. Let Eo be an admissible arc without comers, with end-points on 
the end-space S and homing on it no other point of S. 

If Eo satisfies the si/^fident conditions for a strong relative minimum for the 
prtMem with fixed end-points, it is a member for 0 of an x-parameter famUy 
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{59 '5) of extremal arcs whose end-points are on the end-space S. If, further, the 
values J(t) in {59 S) of 1 == g + I these extremal arcs have a relative minimum 
a/ 1 =a 0 then J(Eo) is a strong relative minimum in the sense that there is a neigh- 
borhood F of Eo in xy-space such that the inequality J{C) ^ J(Eo) is valid for 
every admissible arc C in F with end-points on the space S. If the minimum of J(t) 
a/ 1 = 0 proper, the same is true of the minimum J(Eo). 

If Eo satisfies the sufficient conditions for a weak relative minimum for the 
problem with fixed end-points, then the statements in the preceding paragraph are 
still valid if we replace the word ^^strong^' by the word and the neighborhood 

Y by a neighborhood Ri of the values (x, y, y') on Eq. 

To prove the theorem we first note that, when £o satisfies the sufficient con- 
ditions for a relative minimum for the fixed end-point problem, the arguments 
of the last section show that it is a member of a family (59 • 5) of extremal arcs as 
stated in the theorem, and it also satisfies the hypothesis of the theorem of 
Hahn. 

We may next show that for every positive constant € there is a neighborhood 
F of £o such that for every admissible arc C in F with ends on the space S the 
parameter values h defining the ends of C on 5 are in the c-neighborhood of 
the values h = 0. Suppose that this were not true. Then there would exist a 
constant € > 0 such that every neighborhood (£o)i/m would contain an admis- 
sible arc Cm with end-points on the end-space S for parameter values (/)m not in 
the open neighborhood < €. The values (Om would have an accumulation 
point not in the neighborhood \th\<€] and the corresponding point of S would 
necessarily lie on Eo, since the neighborhoods (£o)i/m condense on Eo- But this 
is contrary to the hypothesis that Eo has no points in common with S except its 
end-points which correspond to the values h = 0. 

To complete the proof of the first part of the theorem let us consider a con- 
stant € > 0 and a neighborhood F which have the properties described in the 
theorem of Hahn. It is evident that if the Hahn theorem holds for a particular € 
and F it will also hold for every smaller pair. We may therefore choose e so that 
the extremal integral J{t) has 7(0) as its minimum in the neighborhood | 1 < € 
and we may choose F so small that, relative to c, it has the property described 
in the last paragraph. Then for every admissible arc C in F, with ends on the 
end-space 5 at the point with the parameter values h, we have 

J (C) -7 (£o) = [7 (C) -7 (/) ] + [7 {t) -7 (0) 1 
= [/(C) -/(/)] + [7(0 -7(0)]. 

On account of the theorem of Hahn the first difference on the right is positive 
unless the arc C coincides with the extremal arc Et of the family (59*5), whose 
ends coincide with those of C. The second difference is non-negative, since 7(0) 
is a minimum; and if 7(0) is a proper minimum it is positive unless Et coincides 
with Eo- Thus the first part of the theorem is proved. The proof of the second 
part is similar. 
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61 . The transversality condition. The purpose of this and succeeding sec- 
tions is the deduction of conditions which are necessary for a minimum for the 
general problem with variable end-points of Section 59, and which, when suit- 
ably strengthened, are sufficient for a minimum. These conditions are analogous 
to those of preceding sections in this and earlier chapters. The sufficiency proofs 
are intimately dependent upon the sufficiency theorem of Section 60. 

The first of the necessary conditions remaining to be deduced is the following 
one: 

Theorem 61 • 1. The Transversality Condition. An admissible arc E12 is 
said to satisfy the transversality condition for the problem with variable end-points 
described in Section 59 if the equation 

(6 M ) [ (/ - y'. A;) dx + fy^.dy^ l + dg^O 

is satisfied identically in the variables dth(h = 1, . . . , r) when the differentials dxg, 
dyis (s = 1, 2) are expressed in terms of the dth by means of the end-conditions 
{59 3). This is equivalent to saying that the equation {6T1) is satisfied by every 
direction dx*, dyia tangent to the end-space S at the intersection (xi, yn, X 2 , yi 2 )o of 
S and Ei 2 . If this condition is satisfied, we say that the space S cuts E 12 transver sally. 
It is understood that the arguments of f atid its derivatives in equation {61 * i) are 
the end-elements of E 12 . Every minimizing arc for the problem of Section 59 must 
satisfy this transversality condition. 

The proof is similar to the ones made for simpler cases in Sections 57 and 58. 
Let dxt, dyu {s = 1, 2) be an arbitrary direction tangent to the space (59 * 1) at 
the point P determined by the end-values of E 12 in the (2» + 2)-dimensional 
space S of points y *) . Consider a curve T defined on the space S by functions 
th{a) having continuous first derivatives near the value a — 0, passing through 
P for a = 0, and having the direction dx^, dyu at P. When the functions th{a) 
are substituted in the functions (59 • 1), two end-curves C and D are determined 
in «y-space, defined by two sets of functions x,(a), y»,(a) {s = 1, 2). Consider, 
now, the family of admissible arcs 

yi [^1 {(T) ^ X ^ X2 (fl) ] 

for which 

y<(*, a) »=yi(*) + {y«(a) -y<lxi(o) ] } — 


+ {y<2(o) -yi[*2(«) ] } 


x-xi(a) 

*2 (a) ■ 


The family contains for a = 0 the arc En whose defining functions are yi(x), and 
for arbitrary values a the end-pomts of its arcs are all on S. According to The- 
orem 8 • 1, the sum / = g evaluated on this family of arcs is a function of a, 

whose first differential is the first member of the equation (61*1). Hence if 
J(En) is to be a minimum, this differential must be zero on Eu, as was to be 
proved. 
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62. The second variation and a fourth necessary condition. For the purpose 
of deducing further necessary conditions for the general problem with variable 
end-points it is desirable to compute the value of the so-called ‘‘second varia- 
tion” of the sum / = g -f 7 along an arc En whose minimizing properties are 
to be investigated. It is assumed that £12 is an extremal cut transversally by 
the end-space 5. We may consider now what would happen if we had a one- 
parameter family of admissible arcs defined by functions 

(621) Viix, a) [a:i(a) ^ IT g a[:2(<i) ] 

containing the extremal arc E12 for the value a = 0, and satisfying the end-con- 
ditions (59 • 3). The analytic expression of these properties is that there would be 
a set of functions tk{a) with 4(0) = 0 such that if we define the functions x^ia) 
by the first of the equations 

(62*2) X, (a) = Xt [t (a) ] , y*- [x» (a ) , a] = y*. [/ (a) ] , 

the remaining ones will also be satisfied. The functions yt(jc, a), yxx{x, a), 4(0) 
will be supposed to have continuity and differentiability properties sufficient to 
carry through the following computations. The existence of such a family will 
be established in the proof of Theorem 62 *1. 

After differentiating the identities (62 * 2) with respect to a and setting a = 0, 
one sees that 

(62-3) Vi(^«) —CxshTh {s — ly 2; i = 1, . . . , n) , 

provided that Vt(x), ta, and are defined, in accordance with the notations 
of M. Morse, by the equations 

ViM =yxa(Xy 0) T^ = /'(0), 

= y (0) - y\ (a:.) (0) (A = 1 , . . . , r) . 

In these equations the subscript h on x, and y,, indicates partial derivatives 
with respect to 4, the functions yx{x) are the ones which define the arc £12, and 
the repeated index s, by exception, does not indicate a sum: The set 17, (x), ta will 
be designated as the variations of the family (62 • 1) along the arc E12. The differ- 
entials associated with this family satisfy the relations 

^yi^yiaixy a) da, b-yi^y,aa{x, a) da^ 

(62*4) dyi^y\dx + byi , 

d^yi — y\d!^x y\^ dx'^ 2 by\dx + b^yi , 

where the symbol b indicates a differential taken with respect to the parameter a 
alone, and the symbol d a differential with respect to a occurring explicitly and 
also in x. 
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Along the arcs of the family (62 ■ 1) the values of the sum J define a function 
/(a) whose first differential clearly has the form 

dJ^J'ia) da^fdx\\+ r’l/,<«yi + Ai«y;]d* + dg. 

*'*1 

Along the extremal arc Eu where a — 0 an integration by parts shows that 
/'(O) is zero, since this extremal satisfies the transversality condition (61-1). 
The second differential d*/ = 7"(0)do* is expressible in the form 

d*/ *= (/d** + /'d** + 2/, .6y<d* + 2fy‘^by\dx ] ! + d*g 

+ + Sy, Sy')]dx 

*1 

where 2« is the quadratic form (36 *4). The notation /' is used to represent the 
derivative of / with respect to the x occurring in all of its arguments x, yt{x, a), 
y'i(x, a). If the usual integration by parts is applied to the first two terms under 
the integral sign in the expression for (PJ and if the 5-differentials are eliminated 
by means of the relations (62 *4) with a = 0, it turns out that 

dU^[fPx+ {Pyi-y[Px)fy*y^ + d^g 

+ [(/.-y;/»i) dx^ + 2fy,dyidx]] + f^y^dx . 

If the identities (62-2) are differentiated with respect to the parameter a, one 
sees that at a = 0 

dXf — Xti,dt\ d^Xt = Xihidihdti x^d^tii , 

dy< = yi,hdth , (Pyt = yMidthdu + yi,kd% . 

It is to be understood that the subscripts h, I on the function g, as well as on x„ 
y,„ will indicate partial derivatives with respect to and h. With the help of the 
transversality condition (61 • 1) and the relation 

d*g = ghd% + ghidtkdt 

it is seen that the terms in d*/* disappear from d^/ and that J"(0) is equal to the 
expression 

(62-5) Jtiri, t) = fc*iT*Tj+ /" '2«(#, ri, r)') dx , 

where hu = is defined by the equation 

+ [ (/ ~ y'Jy'^ Xhl + fyiVihl] * 

+ [ (/, - y'Jy^) xhXi + fti (ytkXi + yuXk) ] \ . 

The expression r) is called the second variation of the sum J along the ex-- 
tremal arc £ 12 . 
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A set of admissible variations riiix)^ n (^i ^ ^ ^ ^ 2 ) is one for which the param- 
eters Th are constants and the functions rii(x) define a continuous arc in opiy-space 
consisting of a finite number of sub-arcs having continuously turning tangents. 
For such sets of variations we can now prove the following necessary condi- 
tion: 

Theorem 62 • 1. yin extremal arc E 12 cut transversally by the end-space S is said 
to satisfy the condition IV if the second variation J 2 (t?, t) along E 12 is non-negative 
for every set of admissible variations i?i(x), tu satisfying the end-conditions (62-3), 
Every minimizing extremal arc E 12 is necessarily cut transversally by the end-space S 
and satisfies the condition 1 V. 

To prove this, consider a set of variations identical on the interval X 1 X 2 with 
the variations n of the theorem but with functions r|^{x) defined and 
having continuous derivatives rii(x) in neighborhoods of Xi and X 2 on an exten- 
sion of the interval X 1 X 2 beyond these values. It will be proved, in accordance 
with a suggestion of Hestenes, that these extended variations belong to a family 
(62 1) with continuity and differentiability properties sufficient to carry 
through the operations involved in the computation of the second variation. 
The function J{a) belonging to the family will then have first and second deriv- 
atives at a = 0, and if £12 is a minimizing arc it will also be true that J\0) 
= 0 and J 2 {Vi t) = /"(^) = 0- The theorem will therefore be proved. 

Let rjiXx) (v — 1, . . . , 2n) be a set of functions having continuous second de- 
rivatives on an interval including the interval X 1 X 2 in its interior, and such that 
the determinant 


(62-7) 


ViAxi) 

(^2) 


is different from zero. Then the 2n equations 

(62-8) y,-, (/) ^yilxAD ] (/) ] +B,ifiiy[xAt) ] 

(i’ ~ 1, ..., n y s “ 1 ) 2 y V — 1, ..., 2Ln'^ 


with th = an, are linear in the parameters By and have unique solutions By(a); 
and By{0) = 0 since the end-points of £12 lie on the end-space S for (/) = (0). 
These solutions By{a) have continuous first derivatives near a = 0, as can be 
seen by differentiating equations (62 -8). Furthermore, £i(0) = 0, since from 
(62 *8) differentiated and (62 -3) 

0= "-CuhTh + rnix,) + (0) =?;»>( a;,) £' (0) . 


At a » 0 the second derivatives BJ'(O) also exist, as one can prove by a second 
differentiation of equations (62 ■ 8) at a = 0. It will be seen that, because rji oc- 
curs only with a factor a, this differentiation can be performed at a = 0 even 
though the functions rii[Xe{t)] have only first derivatives. 

The family 

(62-9) yi{Xy a) -yi{x) +arii{x) +By{a) rjiy{x) 

[a(;i(a) ^x^X2{a)] 
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with 


h (a) = an , x, (a) = x, [t («) ] 


satisfies the end-conditions (62 *2) on account of equations (62 *8), contains the 
arc Ei 2 for a = 0, and has the set iy<(ap), n as its variations along £ 12 . It has, fur- 
thermore, the first differentials dyi{x, a) occurring in the equations (62 *4), and 
also the second differentials S^yi(x, 0). Hence the first differential dJ = J\a)da 
is well defined, and the second differential dU = J'\0)da^ will also exist, pro- 
vided that the function 

f{xAa), y[xAa), a], y'[Xs{a) , a]} 


occurring in the expression for dJ has a derivative at o = 0. But this function 
can be easily seen to have a derivative, since its first two arguments clearly have 
derivatives near a = 0 and since it follows from equations (62 • 9), with the help 
of the values Bv(0) = = 0, that at a = 0 

a] = y^ {x») x,k(0) th + ViM . 

The last equation (62 *4) is also satisfied at 0 == 0 by the differentials of the 
family (62 *9), as can be proved without difficulty. 

63. Further sufficiency theorems. The purpose of this section is the formula- 
tion of necessary conditions and of sufficient conditions for a minimum analo- 
gous to those of Sections 17 and 35 for problems with fixed end-points. In order 
to state such theorems we may designate by the Roman numeral I the corre- 
sponding condition of Section 35 plus the transversality condition of Section 61. 
These conditions are grouped together because they turn out to be naturally as- 
sociated in the multiplier rule, which will be proved in a later chapter for the 
more general problem of Bolza. For the fixed end-point problem, which is a spe- 
cial case of the general problem with variable end-points, the transversality con- 
dition imposes no restriction on the minimizing arc, as one may readily verify. 
The conditions II, III, and their associated conditions are the same as for prob- 
lems with fixed end-points. The second variation is said to be nan-negative if it 
satisfies the condition of Theorem 62 1, which we shall designate as IV. The 
second variation r) is said to be positive definite if it is non-negative and 
vanishes only when the variations rii{x), n are identically zero. The condition 
IV' is that / 2 (^, t) be positive definite. 

Theorem 63 * 1. For an admissible arc E12 without corners ^ with end-points on 
the end-space S and having no other pair of distinct or coincident points forming a 
point on S, the table of necessary conditions and of sufficient conditions for a mini- 
mum in Section 17 is applicable to the problem with variable end-points of Sec^ 
tion 59. 

The statements of the table concerning necessary conditions have been 
proved in the preceding sections. 
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The condition IV' of this section for the arc En of the theorem implies that 
there is no point conjugate to 1 between the end-points 1 and 2 or at 2 on the 
arc £i 2 . This is a consequence of the fact that if there were such a conjugate 
point the second variation / 2 () 7 , r) along £12 would vanish for a set of admissible 
variations (r;,*, n) = (»7i, 0) with functions r]i{x) vanishing at x\ and X 2 but not 
identically zero, as one sees by an argument similar to that immediately follow- 
ing equations (11*2). 

In order to prove the effectiveness for the problem with variable end-points 
of the sufficient conditions in the last column of the table of Section 17, we may 
first note, as a result of the comment in the last paragraph, that these conditions 
imply in each case the analogous conditions for the problem with fixed end- 
points. From the sufficiency theorem already proved, Theorem 60-2, it follows, 
then, that the extremal integral J{t) of that theorem exists. With the help of 
arguments quite like those of Section 62 it may be seen that at (/) = (0) 

dJ — dg-V [Jdx]\-\- f [Jy^^y^ + fv\^y[]dx 
dU=^J2i8y. dt). 

where 

^yi = y^h^th 52y, = y^t^^t^dthdtl . 


With the help of the usual integration by parts it follows that d/ = 0 as a result 
of the transversality condition. The condition IV' of this section implies, fur- 
ther, that the quadratic form dV in the variables dth is positive definite, so that 
/(O) is a minimum for the function /(/). The conclusion of Theorem 63 • 1 is now 
an immediate consequence of Theorem 60*2. 

64. Another form of the fourth condition. In order to state the condition 
which is to be discussed in this section, it is desirable to formulate for the prob- 
lem with variable end-points an accessory minimum problem analogous to those 
of Sections 11 and 36 for problems with fixed end-points. In Section 62 it has 
been shown that the second variation J 2 (ty, t) along a minimizing extremal arc 
must be non-negative. This suggests the accessory problem of minimizing 

f64*l) / 2 (i 7 , t) ==ftA/TAT/+ f *2w(ac, rj , rj')dx 

in the class of sets of admissible variations T 7 »(ac), n satisfying the accessory end- 
conditions 

(64-2) a[:,=const. (5 = 1,2), 

a problem in o^Ty-space of the same form as the original problem of Section 60 in 
«y-space. 

The accessory equations are, as before, the differential equations (36*5) of 
the extremals of the accessory problem. If 2n linearly independent solutions 



168 LECTURES ON THE CALCULUS OF VARIATIONS 

ritw(x) (v = 1, . . . , 2») of these equations are known, the general solution has 
the form 

(64-3) rii(.x) = c,iii,(.x) . 

The accessory transveisality conditions are the conditions 
(64-4) IC<.»«,;(*.)]? + 6»,T, = 0. 

They are the interpretation for the accessory problem of the transversality con- 
dition (61 • 1) for the original problem. 

In the discussion of the necessary condition which is to be formulated in this 
section it will be understood that the extremal arc £12 along which the second 
variation is taken is non-singular and cut transversally by the end-space S, 
Let X 3 be an arbitrarily selected value between Xi and X 2 ; and let Tjtfi (x, xs) 
(/i = 1, . . . , w) be a maximal set of linearly independent broken accessory ex- 
tremals which have no corner, or possibly one comer at xs, and which, further- 
more, satisfy the accessory end-conditions (64*2) with corresponding sets of 
constants = 1, . . . , w). We know that there is such a maximal set since 
the requirement that two families of the form (64*3) adjoin continuously at the 
value X 3 and satisfy the end-conditions (64*2) imposes only linear homogeneous 
conditions on their two sets of constants and their common set of constants 
Th- For the linear family of variations 
(64*5) , Th = afAThn 

the second variation belonging to the arc £12 is a homogeneous quadratic form 
(64-6) Qixz, a) a^r^) 

in the coefficients a^. 

It is evident that when the arc £12 is a minimizing arc the quadratic form 
(64*6) is necessarily non-negative. This form vanishes when the constants a,, 
define an unbroken accessory extremal (64 *5) satisfying the transversality con- 
ditions (64 *4). For in that case the formula 

2co = + 

and an integration by parts, together with equations (64 *2) and (64 *4), show 
that 

(64*7) t) = bhiThTi + [Ci,hThi»>^{ (Xs) ]*i = 0 . 

Furthermore, Q(x8, a) vanishes only for such values a^, since a set of admissible 
variations f;i(x), th which gives J2 the value zero is necessarily a minimizing set 
for the accessory problem. Since £12 is non-singular, the arc -rn^x) belonging to 
such a set can have no comers and must be an accessory extremal and, fur- 
ther, must satisfy the accessory transversality condition. 

The accessory end-conditions (64 -2) are said to be conjugate end-conditions 
if there exists a set of admissible variations rii{x), n, not identically zero, which 
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satisfy them and also the accessory transversality conditions and for which, fur- 
ther, the arc rii{x) is an accessory extremal. 

Theorem 64 * 1. Let E12 be a non-singular extremal arc cut transversally by the 
end-space S and satisfying the condition 111 of Legendre. 

Necessary and sufficient conditions for the second variation J2(>7, r) along E12 
to be non-negative are then that it have this property on all accessory extremal 
arcs with at most one corner satisfying the accessory end-conditions {64^2) andy 
further y that it vanish only on unbroken extremal arcs satisfying both the accessory 
end-conditions {64-2) and transversality conditions {64 4). Analytically y this is 
equivalent to requiring that for every value X3 between xi and X2 the quadratic form 
Q(x 3, a) in {64 6 ) be non-negative and vanish only when the constants a;, define an 
unbroken extremal {64' 5) satisfying the accessory end and transversality condi- 
tions. 

Necessary and sufficient conditions for J2 (t;, t) to he positive definite are the same 
as the above y with the additional restriction that the end-conditions be non-conjugate. 
This means that the positive definiteness of J2(r7, r) is equivalent to the positive 
definiteness of Q(x3, a) for every X3 between xi and X2. 

The necessity of the conditions in the theorem is a consequence of arguments 
which have already been made. 

To prove the sufficiency we note, first, that if there were a point conjugate to 
1 between 1 and 2 on £12 the second variation would vanish for a set [rit{x)y n] = 
[rii{x)t 0], for which rii{x) defines a broken extremal arc with one comer, as can 
be seen by an argument similar to that immediately following equations (11*2). 
This broken arc would satisfy the end-conditions (64*2) with th = 0, which 
contradicts the hypothesis of both parts of the theorem that Q{x 3 y a) vanishes 
only for values a defining an unbroken extremal satisfying the accessory end 
and transversality conditions. A similar argument shows that there is no point 
conjugate to 2 on £12 between 1 and 2 . Consider, then, an arbitrary set of ad- 
missible variations ri^{x)y th satisfying the end-conditions (64-2). Let X 3 be a 
value between Xi and X2. The ends of the arc 17, (x) on the interval xix^ can be 
joined by an accessory extremal, since Xi and x^ are not conjugate values, and 
similarly for the ends of r|^{x) on X 3 X 2 . Each of these extremal arcs gives to the 
integral in (64* 1) a value not greater than that given by the corresponding arc 
of T7»(x), as one may infer from the analogue for (« -f l)-dimensional space of 
Theorem 23 • 3. Consequently, the broken extremal arc which they form, taken 
with the constants r*, gives to J 2 a value not greater than that given by 
Th; and this value is non-negative on account of the hypothesis that Qix^y a) is 
non-negative. It is actually positive if the arc defined by yii{x) does not coincide 
with the broken extremal or if Q{x 2 y a) > 0. This last inequality will surely be 
true, when the end-conditions are not conjugate, unless the broken extremal and 
its values t are identically zero. Thus the theorem is proved. 

If one examines the proof of Theorem 64 - 1 which has just been made, it will 
be seen that the requirement of the theorem on the quadratic form Q{x 2 y a) can 
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be weakened. This requirement is that at every value xa between xi and xa the 
quadratic form Q(x 3 , a) be non-negative and vanish only for values a defining 
an extremal without comers satisfying the accessory end and transversality 
conditions. For Jaivy t) to be non-negative it is evidently necessary, and also 
sufficient, that these properties hold only at every value xa conjugate to xi or xa 
between Xi and xa and that Q have the property of being non-negative in any 
case at one point at least between Xi and xa. 

The criterion for the non-negativeness of the second variation given in the 
theorem is the outgrowth of a number of preceding results. In a paper concern- 
ing the problem of Bolza a fourth condition was formulated by Bliss^ in terms 
of a quadratic form similar to Qixa^ a) but involving only unbroken accessory ex- 
tremals satisfymg the end-conditions. For the problem of Bolza with so-called 
“separated end-conditions’* Hestenes® used, similarly, a family of accessory ex- 
tremals each with at most one comer and satisfying both end and transversality 
conditions. A condition similar to this for a problem in the plane with end-points 
variable on two curves was previously given by Bliss.’ The modification of 
these criteria described in the theorem, involving accessory extremals each with 
at most one comer and satisfying only the end-conditions, is due to Goldstine. 
This modification is more effective than the method of Bliss, for which the 
spread between necessary conditions and sufficient conditions for the second 
variation to be non-negative is greater than desirable in some not too excep- 
tional cases. It has the advantage over the criterion of Hestenes that it is ap- 
plicable even if the end-conditions are not separated. 

65. Focal points for problems with one end-point variable. The problem to 
be treated in this section is that of finding in the class of admissible arcs joining 
a fixed end-space 5 to a fixed point 2 one which minimizes the sum J in (59 • 2). 
The end-space S on which the first end-point only is variable is defined by func- 
tions 

(65-1) XlUu . . . , ir) y Vtliky • • • y k) • 

The purpose of this section is to formulate for this problem a condition, in terms 
of so-called “focal points,” equivalent to the condition IV of Section 62. As in 
the three-dimensional cases, discussed in Sections 57 and 58, focal points play 
the same role, for the problem just proposed above, that conjugate points do for 
problems with fixed end-points. 

It will be understood in this section that Eia is a non-singular extremal arc 
joining the end-space S to the fixed point cut transversally by this end-space S 

* *The Problem of Bolza in the Calculus of Variations,” Annals of Mathematics^ XXXIII 
(1932), 261-74. 

* '^Sufficient Conditions for the Problem of Bolza in the Calculus of Variations,” Transac- 
tions of the American Mathematical Society, XXXVI (1934), 793-818. 

^ “A Boundary Value Problem in the Calculus of Variations,” Bulletin of the American 
Malhemalkal Society, XXXII (1926), 317-31. 
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but not tangent to S at its intersection point L It will be understood, as in pre- 
ceding sections, that the point 1 is defined on S by the parameter values h = 0. 

The computations of Section 62 tell us that along £12 the second variation has 
the value (62 • 5) and that all the terms in the brackets of formula (62 • 6) vanish 
at the fixed end-point 2, It is evident that at the point 1 the accessory end and 
transversality conditions, (64-2) and (64*4), now have the form 

(65*2) >?i(3Ci) ^CiiTif (i(xi)Cih^ bhiTi (A = l, r) 

where 

f i “ ~ yiVi (^) (^ 1 ) • 

Since £12 is not tangent to the end-space 5, the matrix I|C,;,|| has rank r, as can 
be readily verified. At the value X 2 the end-conditions of the accessory minimum 
problem are simply X 2 = constant, rii(x 2 ) = 0, and there are no transversality 
conditions. 

In terms of the second variation the analytic definition of a focal point of the 
end-space S on the arc £12 is given in the following statement: 

Definition of a Focal Point. On a non-singular extremal arc £12 cut trans- 
versally at its initial point by an end-space S a value xz is said to define a focal 
point of S if the accessory equations along £x2 have a solution Ui{x) satisfying 
the accessory end and transversality conditions (65 • 2) at x = with constants 
TA, and having Ui{xz) = 0, but not vanishing identically on the interval XiXz. 

The following theorem gives an important criterion for the determination of 
focal points. It is a generalization of Theorem 12 *3 for the determination of 
conjugate points. 

Theorem 65 1. The totality of solutions of the canonical accessory equations 
{39 • 12) along E12 satisfying the accessory end and transversality conditions {65 • 2) 
with constants rh is a linear family whose basis is a system ?7ik(x), fik(x), rhk (k * 1, 
. . . , n) for which the accessory extremals i^ik, ^ inform a conjugate system. The focal 
points of the end-space S on E12 are determined by the zeros xs ^ Xi of the determi- 
nant lT?ik(x)l. 

To prove the theorem we may first note that every system 77 »a(x), iik{x) 
(fe = 1 , ...,») of linearly independent solutions of the canonical accessary equa- 
tions (39-12) satisfymg the accessory end- and transversality conditions (65-2) 
with constants rhk is necessarily a conjugate system. This follows because the 
expressions 17,* f *7 — f i* 17,7, shown to be constants in Section 39, vanish at « = xi^ 
as can easily be seen with the help of the conditions (65*2) with fi*, tja in 
place of r\iy f i, n. To show that there exists such a conjugate system we observe 
that the (n +• r) equations (65 • 2) have a maximal set of n linearly independent 
solutions 7iik{xi)y ^ik{xi), taa (ft == 1, . . . , w), since the matrix |lCiA|| has, by hy- 
pothesis, rank r. It is also easy to see that for these solutions the matrix \\riik{xi), 
fiifc(i«Pi)|| has rank », since the equations (65 *2) have no non-vanishing solution 
Th with values rii{x^, f »(«i) all zero. Well-known existence theorems 
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now tell us that the canonical accessory equations have n mdependent solutions 
Vik(x)y (ik{x) (A « 1, . . . , n) with the initial values rnkixi), 

Consider now an accessory extremal iii{x)^ fi{x) satisfying the conditions 
(65 • 2) with constants n- From what has been said above, it is evident that the 
initial values rfi(xi), ft(iJCi), n are linearly expressible in terms of the correspond- 
ing initial values at ic = of the conjugate system described in the last para- 
graph. It follows at once that for all values of x we have relations of the form 

(65-3) rii = riikak , f »• = , n = rw-fl* , 

in which the coefficients a* are constants. In particular, if rji[x) characterizes a 
focal point 3 of 5 on E 12 , then it is evident from the equations (65 • 3) that the de- 
terminant \yiik{x) I vanishes at ac = acs, since rii{Xi) = 0. Conversely, if this de- 
terminant vanishes at a value xz tA xi then the constants may 

be chosen so that the corresponding set rii{x), fi(rr), th in (65 *3), which satisfies 
the condition (65 • 2), has = 0. The functions fji{x) are not identically zero 
on the interval xixz since otherwise the functions f ,• = w,;- linearly expressible 
in terms of the i?* and rii would also vanish identically, and the system rjiky f 
(fe = 1, , . . , w) could not be linearly independent. Thus, every zero Xz A Xi of 
the determmant \riik{x) 1 determines a focal point, and the theorem is proved. 

For the proof of the next theorem below it is convenient to note that the con- 
jugate system i?<fc(»), {■<*(«), rwb (fe = 1, . . . , ») of Theorem 65*1 may be so 
chosen that 

ViliXi) ^Cil , ffl(Xi)C,ft= bhl Thl^ Shi 

(65*4) i?»>(^i)=0, f»p(^i)C,fc = 0 , = 0 

(A, / = 1 , . . . , f ; p = r + 1 , . . . , ») , 
as one may verify by an examination of equations (65 * 2). 

We are now in a position to state and prove the following theorem, which is 
the purpose of this section: 

Theorem 65 * 2. Let E 12 be a non-singular extremal arc cut transversally by the 
end-space S at the point 1 and satisfying the condition 111 of Legendre, Then the 
second variation (62 -5) along E 12 is non-negative if and only if the arc E 12 con- 
tains no focal point of the end-space S between the points 1 and 2. It isjurthermorej 
positive definite if and only if E 12 contains no focal point of S between 1 and 2 or 
at 2. 

To prove the necessity of the conditions in the theorem, we may first note 
that a focal point 6 of the end-space S on £» between 1 and 2 has associated 
with it, as in Section 11, a broken accessory extremal iAx), r* having a 
comer at Xz and no other comer. By an argument like that in Sections 11 and 
64 it follows, then, that 

t) « iMnri— f<(aCi)C,iT|*0 , 

since the extremal satisfies the conditions (65 ' 2). This result implies immediate- 
ly that, if the second variation is non-negative, there can be no focal point 6 be- 
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tween 1 and 2 and, further, from Theorem 64* 1, that if the second variation is 
positive definite the end-conditions cannot be conjugate. The end-conditions 
are evidently non-conjugate if and only if the point 2 is not a focal point of the 
end-space S on £ 12 . This completes the proof of the necessity of the conditions 
in the theorem. 

The sufficiency part of the theorem is the statement that, if there is no focal 
point of the end-space 5 on £12 between 1 and 2 or at 2 , then the second varia- 
tion is positive definite, and if there is no focal point between 1 and 2 , the second 
variation is at least non-negative. To prove the first of these statements let 17 , (x), 
Th be an admissible set of variations not identically zero and satisfying the end- 
conditions 

(65*5) (xi) (X 2 ) = 0 • 

We wish to show that for such a set the second variation is positive, and to do 
so we make use of the field in xij-space which is covered by the «-parameter fam- 
ily of accessory extremals in Theorem 65 • 1. If the functions rjaix) are those be- 
longing to this family, determined as in the paragraph preceding Theorem 
65 * 2 , the equations 

(65*6) riiix) ^akVihM +apTiipix) 

(A « 1, r; p = r+ 1 , n) 

will determine uniquely the coefficients a* as functions ajfc(x) on the interval 
xi < X ^ X2, since there is no focal point of S on this interval. We shall presently 
see that these functions have definite limits a^Cxi), ap{xi) as x approaches Xi and 
that Chixi) = Th- Furthermore, it is evident that a^Cx^) = ap{x 2 ) = 0, since 

I Vik(p^ I ^ 

The determinant \riih{xi), is different from zero. Otherwise there 

would be a set of constants C*, Cp, not all zero, satisfying the linear equa- 
tions 

+CX(*i) = 0 . 

Multiplication by Uiv'pi and summation with respect to * and j would 
give 

The last term would be zero, since = 0 and the solutions tm, i;ii and 
are conjugate; and the vanishing of the first term would imply Cj = 
C|C« AB 0 on account of the non-singularity of £u and the condition III. But 
this would further imply Cj — 0, since the matrix |IC«|1 is of rank r. The acces- 
sory extremal C, i}<pCx) would thus vanish with its derivatives at *i and be iden- 
tically zero , which is impossible, since the solutions iii„ (i, are linearly inde- 
pendent. 
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If the equations (65 - 6) are written in the form 

(65-7) + x^]d9 , 

it is evident from the result of the last paragraph that the functions oaCjt), {x — 
«i)ap {x) approach definite limits as x approaches xi. The numerator determinant 
of the quotient representing (x — xi)ap (x) will retain its value if the column 
fli(x) in it is replaced by the column rii{x) — th Vikix), all of whose elements van- 
ish 2 Ltx == xi on account of equations (65 • 2) and the initial values (65 * 4) . Hence 
the functions ap(x) also have unique limits at a; = xi. From equations (65*6) 
or (65 -7) at X = Xi, with (65 -2) and (65 -4), it follows that ah{xi) = Th» 

To prove that the value / 2 (»?, t) of the second variation is positive for the 
set rii{x)j Thj we may define Ui and r* for arbitrary values Xy xz on the interval 
XiX 2 by the equations 

(65-8) Ui(Xy xs) =ak(x3) riik(x) y Viix, Xz) ^ ak(xz) tik(x) 
and consider the function 

/ X, ^x, 

2Q)(Xy Uy U')dX+ I 2(a{Xy 7Jy 7)') 6 X y 

-I 

in which the argument of ah(x) is understood to be a; ~ xz. Since ah{xi) = r* 

and ^{(^ 2 ) = 0, this function has the values 

(65-9) 4»(aci) = 72(171 r) , 4>(:r2)=0. 

Furthermore, with the help of the usual integration by parts and the equa- 
tions 

Ui{xz, Xz) =^rii{xz)y u[{xzy Xz) +Uy^J,Xzy Xz) ^ri[(,Xz)y 

the derivative of at a point xz xi is seen to be 

<>'(X 8 ) = 2 i^,a^a;- [2uu^Vi]^- [2E„{Xy 17 , tt, 17 ')]*, 

where is the Weierstrassian ^-function for the function (a and the ir*(ap, 17 ) 
are the slope functions of the family (65-6). But from equations (65*8) and 
(65*4) it follows that 

[uix^Vi] ^ {xz) CikVi {xiy Xz) = a' (xz) hiai (x,) , 

so that the first two terms in ^^{x^ cancel; and after evaluation of the jB-func- 
tion we find 

^'(^s) “ ~ (17* ““ *•*) ] * . 

On account of the condition III and the non-singularity of the extremal arc £12 
this derivative is negative, and the values (65*9) show that 72 ( 17 , r) is positive 
unless the equations 17 / = jniXy 17 ) are ever 3 nvhere satisfied. But these equations 
are not satisfied, since their only solution through the point (x, 17 ) = 0 ) is 

the set 7ii(x) ss 0. Thus, using again the fact that the matrix ||Cijk|| is of rank r, 
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we see that r) is positive for every set rji{x)y r* satisfying the end-conditions 
(65 -5) and not identically zero. 

If there is no focal point of S between 1 and 2 on the a^rc £ 12 , then the second 
variation 72(»7, t) is at least non-negative. The proof can be made by an argu- 
ment quite similar to that used in proving Corollary 23 - 1. 

Theorem 65 ■ 2, which has thus been proved, shows that for problems with 
one end-point variable on a space S the conditions IV and IV', stated in terms 
of the second variation and used in Theorems 62 • 1 and 63 • 1, are equivalent to 
the non-existence on the intervals Xi < x < X 2 and xi < x ^ X 2 y respectively, 
of values defining focal points of 5. For the problem with one end-point variable 
these conditions are also equivalent, therefore, to those formulated in Theorem 
64 - 1 in terms of the quadratic form QC^a, a), 

66. Dependence of the focal point of a curve on curvature.® In this section 
w^e shall restrict the problem of the preceding section by supposing that the end- 
space S transversal to the non-singular extremal arc £12 at the point i is a curve 
L, Without loss of generality we may assume that the single parameter / in the 
functions (65 * 1) defining L, and in the function g(t) of the sum (59 *2), is the 
Euclidean length of arc on L measured from the point L It will be supposed that 
£12 satisfies the Legendre condition III and has on it no point conjugate to 1 
between 1 and 2, and that / 5 *^ 0 at its initial point L The numbers 

A = [f-yM'^ A- =[/.;]» 


are then not all zero and define a direction in a;y-space through the point 1 such 
that every other direction through the point 1 orthogonal to this one is trans- 
versal to the extremal arc £12 at 1. 

We shall be concerned with the totality of curves L tangent at the point 1 
to a fixed straight line through 1 transversal to the arc £ 12 . It will be shown that 
the focal point of L on £12 is completely determined by the projection a on the 
direction i4, £, of the first curvature 1/p of L at the point 1. When cr varies 
monotonically, the focal point moves monotonically on the arc £ 12 . 

The direction cosines of the fixed line through 1 transversal to £12 will be de- 
noted by a, fii. Since the single parameter t now appearing in (65 • 1) is the length 
of arc measured from the point 1 along £, we have a = Xh( 0), jS, = y»i<(0). If 
the angle at the point 1 between the principal normal of the curve L and the 
vector il, £i is designated by 0, then the projected curvature a of LvX the point 1 
is defined to be the value 


(661) 


cr 


1 

- cos 0 = 
p 


Axut-\-B^y^xii 


* See Householder, “The Dependence of a Focal Point upon Curvature in the Calculus of 
Variation” (diss., University of Chicago), in Contributions 1933-1937, p. 485, where other ref- 
erences to G. A. Bliss and M. B. White are given. 
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where the first curvature 1/p of L is, by definition, 

For the problem with initial end-point variable on a curve the second varia- 
tion from formula (62 * 5 ) has the value 

(66-2) r, a) - bT^+ f * 2 o){Xy rj, ri')dx, 

in which b is defined, from (62 *6), by the formulas 

b^gu- [A xut +Biynu] D = gu- a +BM , 

( 66 - 3 ) 

D^[(h-ylfy,) a^ + 2fy,afiiV. 

Furthermore, the accessory end and transversality conditions (65 • 2 ) now have 
the form 

( 66 - 4 ) f,- («i)C< = ir , 

where the constants 

(66 5) C.-i8..-y;.(^Ci)a 


are not all zero, since the curve L cannot be tangent to £12 when / is different 
from zero at the point i, as was seen in Section 58 . It is evident that the second 
variation J2 is completely determined by the projected curvature a and the vari- 
ations rti{x)f T when £12 and the direction cosines a, ft of the curve L at i are 
fixed. 

In order to study the dependence of the focal point of L on the parameter cr, 
we consider the family of accessory extremals rii{x, xs), ft(x, X3) determined by 
the conditions 

(66-6) X3)=Ci, i?»(x8, x«)=0, 


where xz is an arbitrary value on the interval xi < x < X2. For each such value 
Xs the corresponding extremal of the family is uniquely determined, since there 
is no value conjugate to Xi on the interval xi < x < X2. The value of the second 
variation /2(i?, r, a) on the set iy<(x, xa), t = 1 and on the interval Xi Xa is then 
a function of the form 


( 66 - 7 ) 


0 (X3, cr) -b+J' *2a>(x, 17, ri') 


b-Citi{xu Xz), 


the last equality being justified by the formula 2co = ruiam + 17^(0,;., the usual 
integration by parts, and the initial values (66*6). 
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Theorem 66* 1 . Consider an extremal arc Ew with the properties described 
above, and the totality of curves L transversal to E12 at the point 1 in a fixed direction 
a, jSi. The linear eqmtion in a 
(66-8) qixi, <r) =0 

is necessary and sufficient for a value xa on the interval xi < xs < X2 /(? define the 
first focal point on E12 of all the curves L whose projected curvatures are a. If xa is 
not conjugate to xi then the solution orCxa) of the equation {66 * 8 ) decreases monotoni* 
colly from +<» to its finite value a{x2) as Xa increases from xi to Xa. 

To prove the necessity of the condition (66 • 8) as stated in the theorem, let x^ 
define the first focal point of a curve L with projected curvature <r. There will 
then be a set t satisfying the end and transversality conditions ( 66 * 4 ) 
and the conditions r\i{^^ = Oj with r\\{^ belonging to an accessory extremal 
rii{x), fi(ic). The constant t cannot be zero, since there is no value conjugate to 
xi on the interval Xi <x < x^; and we may therefore suppose, without loss of 
generality, that t = 1 . The accessory extremal rit{x)y f i(^) is then identical with 
the rii{x, Xz), ^t{x, Xz) of equations (66-6), and the equation (66 *8) is a conse- 
quence of (66 * 4 ) and (66 * 7 ). 

Conversely, if the equation (66 • 8) is satisfied, the accessory extremal rii{Xy xz), 
f<(x, Xz) satisfies the transversality conditions (66 * 4 ) with r = 1 because of 
equation (66 • 7 ) ; and hence xz defines a focal point of the curve L with projected 
curvature <r. This focal point is the first one after xi since the solution €r{xz) of 
equation (66 *8) is finite and varies monotonically with Xz, as we shall see in the 
next paragraph. 

The partial derivatives of the function ^(^1:3, a) have the values 
( 66 - 9 ) - [fyoivWkV , - (A^+BA) ^2 , 

in which the functions rn are those in equations (66 • 6) . The first of these is found 
by differentiating the first equation (66 • 7 ) with respect to xz, applying the usual 
integration by parts, and using the fact that the derivatives vanish at Xi 
and have the values —rj' at xz, as one sees by differentiating equations (66-6) 
with respect to Xz- The value so determined is 

9*, = [2« — 2 77 ; 

and this is readily identifiable with the first expression (66 * 9 ), since the func- 
tions rii vanish at Xz. Because of the condition III and the non-singularity of 
£12, both of the derivatives ( 66 * 9 ) are negative; and the equation 

qx, + q^(r' (xz) =0 

shows that y{xz) is everywhere negative, so that <t{xz) monotonically decreases. 

To prove that the limit of a{xz) as xz approaches jci is + 00 , we may first se- 
lect an arbitrary set of admissible variations i?i(*), t with r 5^ 0 and satisfying 
the end-conditions 

Vt(Xi) = CiT , riiixz) =0 . 
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For every sufficiently large positive value 2) the second variation r, 2) is 
then negative, since the coefficient of tr in (66*3) is negative. Hence from The- 
orem 65 • 2 it follows that a curve L with projected curvature 2 necessarily has 
a focal point xz on the interval x\<x < xz, and from the first part of Theorem 
66 - 1 it follows that aCxz) = 2. This means that aixz) takes on arbitrarily large 
positive values, and that it must approach + » as 0:3 approaches Xi. 

In order to discuss the behavior of the function aCxz) as Xs approaches X 2 , 
when X 2 is conjugate to Xi, we shall need to distinguish between two cases. A 
transversal direction a, /Si at the point I will be said to be exceptional if for every 
accessory extremal U{x) with t;<(aci) = rit{x 2 ) = 0 the transversality con- 
dition ^i{xi)Ci = 0 is satisfied. We then have the following theorem: 

Theorem 66 -2. If xz is conjugate to xi and if the direction a, ft transversal to 
the extremal E 12 at the point 1 is not exceptional^ then the solution a(xz) of equation 
{66 -8) decreases monotonically from to — co as xz increases from Xi to X 2 . 

To prove this let 77 *, ft be an accessory extremal with elements rji vanishing 
at Xi and X 2 but not satisfying the condition ti{xi)Ci = 0. The set rji{x)j r = 0 
then makes Jzirij r, a) vanish whatever the value of a. But this set docs not mini- 
mize the second variation in the class of sets t;,, r satisfying the conditions 

rii{xi) —CiT , Hi{x^ =0 

since it does not satisfy the accessory transversality condition ix{xi)Ci = 0, 
Hence, from Theorem 65 • 2 it follows that for every curve L through the direc- 
tion a, at the point 1 there must be a focal value xz on the interval x\<x < 
X 2 - Since this is true whatever the projected curvature <r of L may be, it follows 
that (r{xz) must take on all real values for values Xz on the interval xi < x <X 2 , 
and the conclusion of the theorem can be justified with the help of Theorem 
66- 1. 

TifEOREM 66 -3. If X 2 is conjugate to xi and the direction a, ft is exceptional, 
then the solution cr{xz) of equation {66 8) decreases from to a finite value o-q 
as Xz increases from xi to X 2 . The value <ro is a projected curvature for which X 2 is a 
focal value of L on £12 as well as a value conjugate to xi. 

To prove this let ruifx), ixk{x) be the conjugate system of accessory extremals 
whose elements rnk all vanish at X 2 - If there are n — r linearly independent ac- 
cessory extremals with elements th vanishing at both xi and X 2 , the matrix 
lh<jb(ii:i)|| is of rank r and can be so chosen that it has zeros in its last » — r 
columns. On account of the conjugacy of the system it follows that 

=0 

(/i— 1, •••, r, V ^ r 1 , • • • , w) ^ 


and, since a, fii is exceptional, 


CifivCxi) =0 


(i^ « r + 1 , . . . , ») . 
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Hence the matrix Ct|| is of rank r. It is easy to see, therefore, that there 

is a set of constants t and a projected curvature cro such that the accessory 
extremal Vi = cLiXiv^ satisfies the conditions 

Ui (^l) ~“CxT , Vx (^l) Ci “ , tlx (^ 2 ) ” 0 » 

in which Jo is the expression (66 • 3) with (Tq in place of <r. The constant r does not 
vanish and can be chosen equal to unity. With the help of the well-known 
formula 

and the usual integration by parts it follows that 

/2(w, 1, <ro) = Jo — (aci) = 0. 

Since there is no value conjugate to x\ between x\ and x^y an extension to higher 
space of Corollary 23 1 implies that J^{Uy 1, cro) is a minimum of the values 
J 2 (rjy 1, cTo) in the class of admissible sets rix{x) satisfying the conditions 
(66-10) rii(xi) =Cx , Viix2) =0 . 

Hence, in particular for the broken extremal 17* on the interval X 1 X 2 , a part of 
which is rix(x, Xz), we have 

qixzj <tq) 1» (^o) ^A(«» 1» <^o) = 0. 

The value (r{xz) which makes q(xz, <r) vanish must therefore be greater than or 
equal to cro, since the coefficient of cr in q(x 3 f cr) is positive; and it follows that 
ff(xz) approaches a limiting value <ro ^ cro as 01:3 approaches X 2 . 

To prove that o-q = <^o we may use the facts that for the interval XiXz and the 
extremal Vi(x, Xz) the value J 2 IV, 1, <^(^3)] is zero and that, according to the ex- 
tension of Corollary 23 -1 cited above, this value is a minimum for the class of 
admissible sets jjx{x) on the interval XiXz satisfying the conditions 

Vi (^1) ~ , Vi (^2) ~ ^ • 

As a consequence of this fact, it can be seen that on the class of admissible varia- 
tions Vi satisfying on ^^1X2 the end-conditions (66 10) the function /2(»7, 1| <^o) 
has a minimum less than or equal to zero. Furthermore, by a continuity argu- 
ment similar to that used in proving Corollary 23 - 1 it follows that the minimum 
of J 2 (v, 1, CTo) cannot be less than zero. Hence JzCv, 1, o^o) has minimum zero, as 
was seen in the last paragraph to be the case for J 2 {v> This is possible only 
if CTo = cro. 

Theorem 66 • 4. If two directions are exceptionaly so is every linear combination 
of them. The number of linearly independent exceptional directions is r, when n — r 
is the number of linearly independent accessory extremals T7i(x), fi(x) with elements 
rii vanishing at xi and X2. 

To prove this we use now a, fii as direction ratios instead of direction cosines. 
The first statement of the theorem is an immediate consequence of the defini- 
tions of an exceptional direction and of the constant Ci in (66 - 5). In terms of the 
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notations used above^ necessary and sufficient conditions for a direction a, Pi 
to be transversal and exceptional are the n — r + 1 equations 

A a +BiPi = 0 , 

ixi) Ci = - y'. (xi) f (xi) a + f ,> (xi) /3< = 0 

(i^ =* f “t“ 1 , • • • ) w) . 

These equations are independent, since otherwise there would be constants 
not all zero, such that 

kA- kpy\ (xi) f (oPi) = 0 , 
kBi + kp^ip (xi) = 0 . 

If the last equation is multiplied by y^M, summed for i, and added to the first, 
it is found that A = 0, since the function / is, by hypothesis, different from zero 
at the point 1 on £12. The last equation would then be impossible, since the ac- 
cessory extremals tUp, f i, are linearly independent. Hence the number of linearly 
independent sets a, Pi defining exceptional directions is r. 

67. Problems with variable end-points in the plane. The problem in the 
opy-plane analogous to those of Sections 57 and 58 is that of finding in the class 
of admissible arcs 

( 671 ) y{x) {xi^x^X 2 ) 

joining a fixed curve L to a fixed point 2 one which minimizes the integral 

( 67 - 2 ) I = f 'fix, y, y')dx . 

The curve L is supposed to be non-singular and defined by functions 

1(a), 17(a) (a'^a^a"). 

For this problem an admissible ate Eu joining I, to 2 is said to satisfy the condi- 
tion I if it satisfies the equation 

( 67 - 3 ) /»' - fydx + c 

and if at the intersection point 1 of Eit and L the differentials dx, dy of L and the 
element {x, y, y') of E satisfy the transversality condition 

if - y'fw') dx -i- /,'dy = 0 . 

The conditions II and III are those stated in Section 35. 

To obtain an analogue of Jacobi’s condition for the problem, we suppose that 
£u is a non-singular extremal arc cut transversally bv the curve L at the point 1 
and not tangent to L at 1. By the method described for the problem of Section 
57 it is known then that a one-parameter family of extremals y(x, a) exists 
containing £u for values (x, a) satisfying conditions of the form xi ^ x ^ x*, 
a « ai and such that each extremal of the famUy is cut transversally by the 
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curve L at the value x = {(a). A focal point of the curve L on £ is then a point 3 
defined by a zero acs of the function y. (x, ai), that is, a contact point of E with 
an envelope of the extremal family. The arc En is said to satisfy the condition 
IV if there is no focal point of L on £12 between its end-points 1 and 2, 

With these definitions of the conditions I-IV the theorems implied in the table 
of Section 17 concerning necessary conditions and sufficient conditions for a 
minimum are again valid, provided that £ is an admissible arc without comers 
intersecting the curve L in a single point 1 but not tangent to L at 1. The proofs 
may be made by methods quite similar to those of Section 57, or the problem 
may be regarded as a special case of the more general ones discussed in Sec- 
tions 59-66. 

A second problem with variable end-pomts in the plane is that of finding in 
the class of admissible arcs (67 • 1) joining two fixed curves L and M one which 
minimizes the integral (67 *2). The curves L and M are supposed to be non- 
singular and defined by functions 

fi(a), nAa) (a'^a^a"), 

respectively. This problem is also a special case of those considered in Sections 
59-66, but there is an interesting interpretation of its condition IV which has 
been justified by Bliss.® 

An admissible arc £12 joining the curves L and M is said to satisfy the condi- 
tion I for the problem just formulated if it satisfies the equation (67*3) and if, 
furthermore, it is cut transversally by L and M at its intersection points 1 and 2 
with them. The conditions II and III are again those of Section 35. The second 
variation (62 *5) along a non-singular extremal arc £12 cut transversally by the 
curves L and M has the form 

^^ 2 ( 17 , t) =iiT®+ 52 T® + J 2(a{Xy 17 , dx , 

*1 

in which 2« is the quadratic form (36 *4); and from (62 * 6 ) 

61 = - [ (/-//,') i..+ h'Vaa+ Ux-y'Jy) + 2 ‘ , 

* 2 = [ (/ — //»') + Ux — y'fv) ki + 2fvifiVfiV • 

The condition IV of Theorem 62 • 1 requires this second variation to be non- 
negative in the class of admissible sets fi(x)y n, r 2 satisfying the end-conditions 
(62 *3), which now have the form 
(67-4) ff (xi) =Ciri , 71 (X 2 ) -C 2 T 2 f 

with 

Cl = riia — y' (^Pi) tia , C 2 = 172^ — y' {X 2 ) { 2 ^ . 

Criterion When Both End Points Are Variable,” Malhematische Annalen, 
LVIII (1904), 70-80, and ”A Boundary Value Problem in the Calculus of Variations,” Bulletin 
of the American Mathematical Society^ XXXII (1926), 317-31. 
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We shall assume henceforth that the extremal arc En under consideration is not 
tangent to either of the curves L and il/, so that the constants Ci and C 2 are 
both different from zero. We may further assume, without loss of generality, 
that these constants are both positive, since Ci, for example, can be made posi- 
tive by replacing a by —a if necessary. 

The focal points of the curve L on the extremal arc Eu are defined by the zeros 
of a solution u{x) of the accessory equation determined by the initial condi- 
tions 

(67 5) u{xi) =Ci , w, = 

as one sees from equations (65 • 2) and the subsequent definition of a focal point 
in Section 65. Similarly, the focal points of the curve M are defined by the zeros 
of an accessory extremal v(x) satisfying 
(67-6) v{x2)^C2, V, »')]^C 2 =— 62 , 

the negative sign in the last equation being due to the fact that the curve M is 
transversal to En at the second end-point 2 instead of L From Theorem 65 • 2, 
or by a direct proof like the one given for that theorem, it follows that when E 12 
satisfies the condition IV it can have on it between 1 and 2 no focal point of 
either of the curves L and M. The functions u{x) and v{x) are, therefore, both 
positive on the interval X\ <x <X 2 ^ The sets ri, 72 determined by the 
conditions 

ri (x) = « (jip) Ti on Xi^x^ X’i 


(67-7) = V (x) 72 on Xz^x^X 2 

u {Xz) 71 = (rrg) 72 

satisfy the accessory end and transversality conditions 

•n =^171 , T7 ( 0 : 2 ) =C272 , 


wv (^1) Cl = Jin , Wi|' {X2) C2 = 6272 , 


as may be seen with the help of equations (67 • 5) and (67 • 6) . Using the usual in- 
tegration by parts and simple manipulations, we find for such a set ri(x)y 7i, 72 
the value 


(67-8) /2(i7, r) = ri72[ (vw' — w»') /vVJ* • 


The third equation (67-7) shows that 71 and 72 have the same sign, so that the 
condition IV requires the bracket to be non-negative. The bracket has the same 
value at all points xz on the interval X 1 X 2 , smce, if we define J(ri) by the equa- 
tion ^ 


if follows that 


— [ (w»'~ vu') //„'] -uJ{v) - i;7(w) = 0 . 
We can now prove the following theorem: 
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Theorem 67*1. For a non-singidar extremal arc E12 in the xy-plancy cut trans- 
versally at 1 and 2 by the curves L and M, the condition IV j the non-negativeness of 
the second variation, is equivalent to requiring that L and M have no focal points on 
the arc E12 and that for the accessory extremals u(x), v(x) whose zeros define these 
focal points the expression vu' — uv' is zero or has the same sign as the product uv 
at each value X3 between xi and X2. The condition 1 is equivalent to the same re- 
quirements with the additional one that vu' — uv' 9 ^ 0. 

In the preceding paragraphs and here we are assuming Ci and C2 both posi- 
tive and u and v therefore positive at Xi and ac2, respectively. The statements of 
the theorem for the other cases follow readily from this one. 

If the condition IV is satisfied, the curve L cannot have a focal point at 2. 
Otherwise we should have u(x^ = 0, u\xi) < 0, »(ii(;2) = C2 > 0; and the ex- 
pression (67 -8) for the second variation would be negative. A similar argument 
shows that the curve M cannot have a focal point at Jf. The other consequences 
of the condition IV stated in the theorem have been justified above. 

It will next be proved that the requirements stated in the theorem imply the 
non-negativeness of the second variation and, consequently, the condition IV. 
This can be proved in a number of ways, but most simply, perhaps, by the use of 
a formula analogous to the formula (39 *5). Let r\{x)y n, t 2 be an arbitrary set of 
admissible variations satisfying the end-conditions (67 *4). We wish to calculate 
the value /2(^, r) in the third equation preceding (67 *4). Two values, wi and w^, 
may first be determined by the equations 

Wiuix^ -nix^ =W2V(^s) 

where u and v are the accessory extremals defined above and xz is an arbitrarily 
selected value between xi and xz- In Figure 67 *1, C12 represents the curve in the 
3t:iy-plane defined by y\{x), and £43 and £35 are the particular extremals wiu and 



WzV, respectively. If 1% represents the integral in 72(^7, r), then with the help of 
the properties of the invariant integral h* in the field of accessory extremals 
au it follows that 

^2 (C13) = 1/2 (C13) - It (C13) ] + /♦ (Lu) + 12 (^43) . 
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When the integrals on the right are evaluated with the help of equations (67 ■ 5), 
it turns out that 

ItiCit) = [»«,'(*, u, «')]*• 

•'*1 

If this value and a similar one for hiCa) are substituted in Jj(i 7 , r), it follows 
that 

(67-9) /*(»», t) = /* ’(ij'-T)*/,vd*+WiWj[ (»«'-«»') /vVl* » 

in which the slope functions of the two fields involved are t = riu'/u and t = 
iiv'/v on the intervals xixt and xtPCt, respectively. The remaining conclusions of 
the theorem follow readily from (67* 10). 

If there is an extension of the extremal arc £12 on which the curves L and M 
have focal points i' and 2', the results of the last theorem can be given a still 
different interpretation. The s 3 anbols 1' and 2' will always be used to designate 
the first focal points of L and M on the right or on the Ipft. 

TKeosem 67-2. If there is an extension of the extremal arc £12 on which the 
curves L and M have focal points 1' and 2', then the condition IV is equivalent to 
requiring that the arc £12 contain neither of these focal points and that the arc 11' 
of £ contain neither or both of the points 2 and 2'. The condition IV is equivalent 
to the same requirements, with the addition that 1' and 2' are distinct. 

The case when 1' is the first focal point at the ri^t of 1 will be discussed in 
this paragraph. At the pomt 1' we then have « = 0, «' < 0. The relation vu' — 
mu' ^ 0 of Theorem 67 - 1 implies that v ^ 0. The focal point of if is conse- 
quently either coincident with 1' when » = 0 or between 1 and 1' when » < 0. 
With the help of these remarks the conclusions of the theorem follow readily 
from those of Theorem 67-1. The argument depends upon the well-known facts 
that a solution u{x) 0 of a linear differential equation of the second order 
never vanishes with its derivative and that two such solutions have zeros which 
separate each other. 

A similar proof can be made in the case when 1' is the first focal point of L 
at the left of 1. 

For an admissible arc £12 without comers and having only its end-points 1 
and 2 in common with the curves L and M the theorems concerning necessary 
conditions and sufficient conditions for a minimum are those indicated in the 
table of Section 17 with the interpretations given in this section for the condi- 
tions I-IV. The proofs of the necessary conditions have been indicated in the 
preceding paragraphs, and the theorems concerning sufficient conditions are 
consequences of the theorems of Section 63. Independent proofe applicable to 
the problem of this section have been given by Bliss in the references cited 
above. 



PART II 

THE PROBLEM OF BOLZA 




CHAPTER VII 
THE MULTIPLIER RULE 

68. Introduction. The problems of Lagrange, Mayer, and Bolza with variable 
end-points are the most general problems of the calculus of variations involving 
only simple integrals for which a theory having some degree of completeness 
can now be formulated. The most economical theory of the three problems 
can be attained by studying, first of all, the problem of Bolza, since this type in- 
cludes the other two by means of simple specializations and since in each of the 
three cases the so-called “accessory minimum problem’^ of the second variation, 
which plays an important auxiliary role in the theory, appears as a problem of 
Bolza and requires no new theory for its analysis. 

Theoretically, the three problems are equivalent. It has been known for some 
time^ that the problem of Bolza can be transformed into a problem of Mayer. It 
is also true that there is a very simple transformation, apparently not hitherto 
noted, which takes the problems of Mayer and Bolza into problems of Lagrange. 
In Section 69, below, these relationships between the various problems are dis- 
cussed in more detail. 

The multiplier rule for the problems mentioned in the preceding paragraphs 
and the necessary conditions for a minimum usually designated as those of 
Weierstrass and Clebsch have been deduced by several writers.* A necessary con- 
dition analogous to those of Jacobi for simpler problems is suggested in early 
papers by Mayer [1, 3]; and a corresponding condition, stated in terms of the 
smallest characteristic number of a boundary-value problem associated with the 
second variation, was deduced by Cope [16]. The first published discussion of 
sufficient conditions is that of Morse [23] for the problem of Bolza. Shortly after 
the appearance of Morse’s paper Bliss [25] published a quite different theory of 
sufficient conditions, which he had long known and which was suggested by the 
papers of Mayer (to which reference is made above) and which involves an im- 
portant theorem due to Hahn [7i]. Still later, a third discussion of sufficient con- 
ditions for the problem of Bolza was made by Hu [31]. On account of rather 
strong assumptions concerning the so-called “normality of the minimizing arc,” 
the methods of these writers cannot be applied to the problem of Mayer with 
variable end-points without serious modifications. The method of Bliss has, 
however, been extended by Bliss and Hestenes [30, 34] to include this case. 

^ See Bliss [11], p. 314. The bracketed numbers here and in later references, as well as in the 
text, indicate items in the Bibliography of the Problem of Bolza, which appears on pp. 274-83, 
below. 

* See, e.g., Bolza [81; Bliss [11, 17]; Morse and Myers [24] ; Bliss and Schoenberg [26]; Mc- 
Shane [64]. 
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That the nonnality assumptions used by Morse and also by Bliss and Hes- 
tenes are stronger than necessary is shown in a number of recent papers. Graves 
[29] modified the proof of the necessary condition of Weierstrass in ingmiious 
fashion so that it is effective under much weaker normality assumptions; and 
McShane [64] has recently shown that this condition, like the multiplier rule, 
can be established without normality assumptions of any kind. Hestenes [38] 
generalized for the problem of Bolza a necessary condition for a minimum which 
had earlier been applied by Bliss [15] to a relatively very special problem in the 
plane. He further made a sufficiency proof for the problem which is remarkable 
in that it applies not only to normal extremal arcs which are not necessarily 
normal on every subinterval but also to some abnormal cases. The results of 
Graves and Hestenes bring to a satisfactory close the important chapter of the 
theory of the problem of Bolza which applies to minimizing arcs that are normal 
but have not necessarily the property of normality on every subinterval which 
was previously undesirably presupposed. The conclusions of the theory, estab- 
lished by the methods now possible in view of their researches, are applicable 
without modification to the problems of Mayer and Lagrange with variable end- 
points as specializations of the problem of Bolza. Such applications were not 
I>ossible for the earlier theories, as has been indicated above explicitly by Bliss 
[56]. 

The attention of students of the calculus of variations has been attracted to 
the so-called “abnormal” problems by two papers by Carath6odory [27, 36] in 
which he studies the theory of the second variation in such cases and makes a 
classification of abnormal problems. It seems unlikely to the present writer 
that a comprehensive theory without any normality assumption whatever can 
be developed in the near future, in view of the enormous variety of singular 
possibilities which present themselves. These singularities are analogous to 
those possible for implicit functions but are even more numerous and compli- 
cated. The results of Hestenes [38] did, however, extend our knowledge to spe- 
cial abnormal cases, and in the domain of sufficiency proofs they were the first 
such extensions. At the September, 1934, meeting of the American Mathemati- 
cal Society, Reid [44] announced an interesting auxiliary theorem which seems 
to make it possible to extend the method of Bliss for the problem of Bolza to the 
more general cases studied by Hestenes. Reid’s theorem has also been proved in- 
dependently by Morse [41] and Hestenes [46]. In papers by Hestenes [38, p. 794 ; 
50, p. 141] and Morse [41, p. 147] the statement is made that the hypotheses of 
the sufficiency theorems there proved contain no assumptions with regard to 
normality. This seems to me only superficially true, since the theorems in ques- 
tion are reducible in simple fashion to ones for normal problems, as has been 
shown by Bliss [56, p. 370]. Quite recently McShane [64, 69, 71, 76, 74] and 
Myers [77] have published interesting results concerning abnormal cases. 

Tbe purpose of the following pages is the presentation of the theory of the 
problem of Bolza in the relatively complete form which has been made possible 
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by the researches briefly described in the preceding paragraphs. The problem is 
formulated approximately as stated by Bolza himself, and the emphasis is on 
the normal problems, which seem to be by far the most important ones. The 
more Important results so far deduced for abnormal problems can be established 
by making the proofs for suitably phrased normal problems, as indicated in 
Section 92, below. 

69. The equivalence of various problems. The problem of Bolza [8] as 
formulated by Bliss [25] is that of finding, in a class of arcs 

(69-1) y%{x) (* = 1) »; *i^*^* 2 ) 

satisfying differential equations and cnd-conditions of the form 

(69-2) y, yO =0 (fl = 1, , 

(69-3) y(*i) . * 2 . y (*s) 1 =0 (m = 1, ^ 2« + 2) 

one which minimizes a sum of the form 

/ ^S 

y, y) • 

-I 

In these expressions, and often in the following pages, the symbols y and y' 
stand for the sets (yi, . . . , yj and (yf, . . . , yn) ; and similar notations will be used 
for other sets when convenient. In an analogous statement of the problem of 
Bolza by Morse® the conditions (69 *3) are replaced by parametric equations of 
the form 

(69*5) Xf — x^ (/) , y,’ (^#) — y»* if) “ 1; • • • > ^ 2) , 

where / is a set of parameters (/i, . . . , Qy and the function to be minimized is 
taken in the form 

(69*6) / = g (0 + Pfix, y, y') dx . 

The problem of Mayer with variable end-points as formulated by Bliss [11] 
is the problem of Bolza first described above with its integrand function f identi- 
cally zero, and the problem of Lagrange with variable end-points is similarly 
the case when g is absent from J in (69 *4). In Morse’s form it would be the case 
when g is absent from 7 in (69 *6). 

It is easy to see, with the help of very simple considerations, that the problem 
of Bolza is transformable into a problem of either Mayer or Lagrange with varia- 
ble end-points. It is, in the first place, equivalent, as one readily verifies, to the 
problem of Mayer having a class of arcs 

yiix), yn+i{x) (i = l, «; xi^x^x2) 

* [22], pp. 518-19; Morse and Myers [24], pp. 235-36. 
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subjected to conditions of the form 

<f>e = 0, y, y') *0, 

= 0 , y«+i (*0 = 0 

and having the function J — g-\- yn+i(*») to be minimized. It is, furthermore, 
equivalent to a problem of Lagrange with a class of arcs 

yn+i(^) (i=l, »; xi^x^x2) 

satisfying conditions of the form 

ifc. = o, y«+i(*i) -^ = 0 
and with an integral to be minimized of the form 

/ = /* (/ + 3'n+i) dx . 

*'*1 

This last transformation is the one mentioned above in the second paragraph of 
the introduction (Sec. 68). Since the problem of Bolza contains the problems of 
Lagrange and Mayer as specializations, it is evident from the two transforma- 
tions of this paragraph that the three problems possess the same degree of gen- 
erality. 

One can also verify, without difficulty, the equivalence of the problem of 
Bolza having non-parametric end-conditions and sum / of the forms (69-3) 
and (69 *4) with the one having parametric end-conditions and sum (69 *5) and 
(69 -6). For, in the first place, r = 2n + 2 — ^ new functions ^p^[xi, y(xi), X 2 , 
y{x 2 )] (A = 1, . . . , f) can be chosen in such a way that the 2n + 2 equations 

y(xi), X2y y(x2)] = 0 , 

}Pp+h[xu y(xi), X2, yM] =tH 

are solvable for the 2n + 2 end-values xi, y(xi), X 2 , y{x 2 ). Then the equations 
(69 *3) are expressible in the form (69 *5), and the function g in (69*4) becomes 
a function g(/). The choice of the functions \l/p^h is possible since in the develop- 
ment of the theory it is always presupposed that the matrix of derivatives of 
the functions with respect to their arguments xi, y{xi), X 2 , y(x 2 ) has rank p. 
On the other hand, a parametric problem is expressible as a non-parametric one, 
since the non-parametric problem with a class of arcs 

yi(»), hix) (i*l A = , r; « 2 ) 

satisfying the conditions 
^ = 0 

x,-x^[t{xi) ] =0 


(69*7) 


yx{x.) -yu[t{xi) ] =0 
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and with the function 

/ = [/(*i)] + /* ’/(*. y. y')dx 
•'*1 

to be minimized, is equivalent to the original problem with parametric condi- 
tions in the forms^ (69 5) and (69-6). 

The problem of Bolza associated with the expressions (69 1) to (69 -4) is 
said to have separated end-conditions if the conditions (69 • 3) fall into two groups 
of the form 

/fio y (*i) ] =0 , y (* 2 ) ] =0 

^ (P = l. .... r; (r = r + l, ... p) 

and if, furthermore, the function g in (69-4) is a difference 
g = gl[XU - g2[X2, y(x2)]- 

Otherwise the problem is said to have mixed end-conditions. The problem of 
finding, in a class of arcs 

(69-9) yi{x), yn+aix) 

(t=l, »; a = l, » + 2; Xi^x^X2) 

satisfying the different equations and end-conditions 
(x, y, y') = 0 , = 0 , 

iAp[y 2 «+i(*i)» yi(xi), y 2 n+t(xi), y„+i(xt)] = 0, 

(6910) 

y2«+i(*i) - *1 = 0 , 

y«+.- (*2) - y< ( xi ) = 0 , y2«+2 (*2) -*2 = 0, 

one which minimizes the functional 

/ (x, y, y') dx 

-1 

is a problem with separated end-conditions equivalent to the original one. 

Similarly, the problem of Bolza in the form (69 • 2, 5, 6) is said to have sepa- 
rated parametric end-conditions if equations (69*5) have the form 
X\ = Xi (/i, • • • , tp ) , y% (^i) “ yn {tij • • • > O » 

X2 =* X2 itp+if • • - , /f) , y» (^ 2 ) ~ y»2 (fp+l» • • • f ir) 

and if the function g{t) has the form 

g ^ gl ihf ••• 9 ip) g2 (^P+lf ••• 9 ir) • 

Otherwise the end-conditions are said to be mixed parametric end-conditions. 
The problem with a class of arcs 
(69- 11) yi{x)y yn-^h{x) 

(t “ 1, • • • , n* h ~ 1, . • • , Xi ^ X ^ ^ 2 ) 

* See Bliss and Schoenberg [26]. 
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satisf^ng differential equations and end-conditions 

= y'+* = o 

Xi =* Xi (lif • • • f ^r) f yi (^l) “ yn C^lf • • • > ^r) > 

(69-12) yn-hh(xi) 

X2 = (^+lj • • • » ^2r > y» (^ 2 ) “ y»2 (^r+1, • • • 1 fer) > 

yn+A(^2) =/r+fc> 

and with the function J in (69 • 6) to be minimized, has separated end-conditions 
and is equivalent to the original one [Hestenes, 38, p. 815]. 

When Xi and X 2 are allowed to vary, the end-points of an arc (69 • 1) describe 
a two-dimensional subspace of the (2n + 2)-dimensional space of points (jci, 
yiu ya)* An arc (69 * 1) satisfying the end-conditions of the problem of Bolza 
is said to satisfy the non-tangency condition if the two-dimensional locus of its 
end-points has no tangent in common with the space of (2» -1- 2)-dimensional 
points defined by the end-conditions of the problem. The tangents to the two- 
dimensional space have the direction ratios 

(^i) : 0-2 • ^2y\ (^ 2 ) » 


in which ai and 02 are arbitrary constants not both zero. Such a direction is 
tangent to the space defined by the equations (69 -3) if and only if for every 
/i = 1, . . . , ^ the condition 

ai + 3^^ (^ 1 ) ] +a2 [^m.2 + ^m.» 2 yj (a(;2) ] =0 


is satisfied, in which 



i\ 


^yi (:xJi) 


and 2 and ify,, ^ have similar interpretations. Hence it is evident that a neces- 
sary and sufficient condition for an arc to satisfy the non-tangency condition is 
that the matrix 


n y'i « K2+K i 2 Vi (**) II 


has rank two. A curve (69 * 9) satisf)dng the differential equations in (69 ‘ 10) has 
alwa}m this property relative to the end-conditions in (69*10), as one readily 
verifies. 

If the end-conditions are given in the parametric form (69*5), let us use the 
notations 

Xfh 1 yitii I y«« ■“ y«* (*«) > 

cmU) =yiA-yi,x,ii , 


(69*13) 


(t * 1, . . . , 


u; A = 1, . . . , r; « » 1, 2) . 
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Then the criterion for non-tangency is that the equations 


ilthTh = f 




have only the solutions (t», o,) = (0, 0). This is equivalent to saying that the 
matrix 


(6914) 


*1* 1 0 

ym yu 0 


iyiSA 0 y'JI 

has maximum rank or that the slightly tr 2 uisformed matrix 


Ci\h 

Cilh 


has the same property. An arc (69*11) satisfying the differential equations in 
(69 * 12) has slv/ays this property relative to the end-conditions in (69 * 12); and, 
furthermore, these end-conditions are non-singidar at the ends of the arc in the 
sense that the matrix of the first r columns of (69* 14) has rank r. 

The upshot of the remarks which have been made in the preceding para- 
graphs of this section is that the problems of Mayer and Lagrange with variable 
end-points and the problem of Bolza with either non-parametric or parametric 
end-conditions are all of equal generality and equivalent by relatively simple 
transformations. Furthermore, there is no loss of generality if it is assumed that 
the end-conditions are separated or non-singular or that the arcs satisfying the 
differential equations = 0 of the problem all satisfy the non-tangency condi- 
tion also. In the next section a more explicit description is given of the form in 
which the problem of Bolza will be studied in this book. 


70. Analytic formulation of the problem of Bolza. In order to secure a firm 
anal 3 d:ic foundation for the theories of the following pages, let R\ be an open 
region of (2« + l)-dimensional points (ac, y, y') in which the functions and / 
have continuous partial derivatives of at least the third order. In order that the 
equations = 0 may be independent differential equations, it is assumed that 
the matrix is ever 3 rwhere of rank m in Riy the subscripts yl here, as else- 
where, indicating partial derivatives. Similarly, R 2 is supposed to be an open 
set of (2» -f 2)-dimensional points (xi, yu, xz, yiz) in which the functions and g 
have continuous partial derivatives of at least the third order and in which the 
matrix 

(70*1) H^M.I ^m.2 

has rank p. In these statements the symbol yn, for example, stands for yi(«i) 
and 
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with similar notations for other derivatives. A set (jt, y, y') or (xi, ya, ^ 2 , ya) is 
said to be admissible if it is in Ri or i?*, respectively. 

The arcs of the type (69 * 1) to be considered are supposed to be continuous 
and to consist of a finite number of sub-arcs on each of which the functions 
y<(a;) have continuous derivatives. Such an arc is said to lie in Ri if its elements 
(^> Vi y') admissible, and it is said to be an admissible arc if its elements 
y) and [xij y(»i), » 2 , yixii] are all admissible. On every such arc the func- 
tions /, g and the function / have well-defined values. One should 

notice that this definition of an admissible arc is not that previously adopted by 
Bliss [17, p. 677] or that of Morse [23, p. 518]. The definition Sgiven here is de- 
signed merely to make sure that on each admissible arc C of the type (69 • 1) the 
sum J in (69*4) has a unique value /(C7). With the definition of admissibility 
formulated in this way it is believed that the statements of the various theorems 
are more self-contained. 

The problem of Bolza is now that of finding, in the class of admissible arcs 
(69-1) satisfying the differential equations = 0 in (69 *2) and the end-condi- 
tions = 0 in (6^ *3), one which minimizes the function J in (69 -4). If we 
designate by BT the class of admissible arcs satisfying <l >0 = 0 and = 0, then 
the problem is that of finding in Bt an arc C on which /(C) is a minimum. 

For the problem of Bolza in the form (69 *5-6) one may presuppose an open 
set T of points ^ = (/i, . . . , U) in which the functions a;,(0, yia(0» giP) bi the end- 
conditions (69 *5) and the expression (69 *6) for / have continuous partial de- 
rivatives of at least the third order. An admissible set 
(70*2) yi{x), th (i= 1, ...,»; A = 1, ..., r; xi^x^xi) 

is one for which the arc yi{x) is an arc in R\ and the set t is in T, The problem is 
then that of finding in the class of admissible sets (70*2) satisfying the differ- 
ential equations (69*2) and the end-conditions (69 *5) one which minimizes the 
sum J in (69*6). 

71. Variations and the equations of variation. In the following pages families 
of arcs with special continuity properties are frequently considered. In order to 
describe, once for all, the properties of these families, one may begin by defining 
an elementary family 

(7M) Vi{x b) {x^^x^y/^\ 1*1 

as one for which the derivatives y<(ir, b) exist and, with the functions yt(ap, 6), 
have continuous derivatives of at least the first order with respect to i in a neigh- 
borhood of the region of pomts (jr, b) indicated in the parenthesis on the right. 
Two such families are said to be adjacent if they are defined on adjoining inter- 
vals X* ^x^ X** and x** "^x^ and if, furthermore, their defining functions 

form continuous functions y<(jif, b) on the region x' ^ x ^ | A [ ^ €. The fam- 

ilies of the somewhat more general form 

(71*2) yi(x, b) [xi(b) ^x^X 2 (b); \b\<€] 



THE MULTIPLIER RULE 


195 


to be considered are supposed to have functions Xi(b), X 2 (b) with continuous de- 
rivatives of at least the tet order with respect to 6 in the region |(| < e, and 
each such family is supposed to be a part of a sequence of adjacent elementary 
families defined on adjoining intervals x'x", *'V", . . . , *<*“*>*<*> such that 

x' <Xi{b) < x” , < *2 ( i) < . 

A family (71-2) with these properties, all of whose arcs are admissible, is called 
an admissible family. 

The differentials of the family (71-2) will be represented by the notations 
dxi^Xadb, dXi — Xibdb 
(71-3) dyi == y.dx + Syi , 

(Py. ^ y'.Px + y'.dx^ + 2 6y[dx + 6^yi , 

where the symbol 5 indicates differentials taken with respect to the parameter b 
alone. Every admissible family has the differentials in the first two rows of 
(71-3), but not necessarily the differential since in the definition of such a 
family no assumption is made concerning the derivatives yj'. But we shall see 
later, in Corollary 77 *2, that admissible families with these differentials can be 
constructed when the arc E contained in the family (71 • 2) for J = 0 has con- 
tinuous second derivatives. 

The set of variations of the family along E is the set {i, ^ 2 , »?*(^) defined by the 
equations 

dx\ = (0) db = f 1 , dx 2 = X 2 h (0) db = Jc 2 db , 

(71-4) 

byi=yih{x, 0) db ^ rii(x) db. 

The symbols fi, {2 represent constants; and as a result of the continuity proper- 
ties of the family (71 *2), it is easily seen that the functions rii(x) have the con- 
tinuity properties of admissible arcs y^{x) on the interval a;i(o) ^ x ^ X 2 ( 0 ) 
belonging to E, Every set fi, { 2 , Vii^) with these properties is called a set of ad- 
missible variations along E. 

If the arcs of an admissible family all satisfy the equations 

y{Xy b) , y' (x, 6) ] = 0 

then the variations rii(x) along the arc E contained in the family for b = ioi 
satisfy the equations 

^71*5) ^fiiXy riy v') -^fiyiVi + <l>Pvi'n'i = 0 y 

in which the arguments of the derivatives of <l>fi are the functions yi{Xy bo) be- 
longing to E. The equations (71*5) are called the equations of variation along E. 
Their coefficients are completely determined, irrespective of any family, when 
the arc E is given. The repeated subscripts in these equations indicate that 
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sums for i » 1 , . . . , n are to be taken, as customary in tensor analysis. This 
convention is used freely in the following pages. 

If the end-values of the arcs of an admissible family satisfy the equations 
y[*i(i)i *]> xtib), d]}= 0 , 

then by differentiation with respect to & it is seen that the variations of the fam- 
ily along E satisfy the equations 

(71*6) V 9 >7 (^ 2 ) ] = 0, 

where 

(71-7) + + (K* + yi2t.i2'> 

and the arguments of the derivatives of are the end-values of E, The equa- 
tions (71-7) are called the equations of variation on E of the end-conditions. 

It should be noted that the definitions in the preceding paragraphs can be ex- 
tended at once to families (71*2) for which the symbol h stands for a set of pa- 
rameters ( 61 , . . . , 6 *). The family (71*2) will then have a set of variations 

for each parameter 6<r (<r = 1, . . . , 5 ). The modifications necessary in 
the differentials (71*3) and (71*4) can be readily inferred. 

72. A fundamental imbedding lemma. If an admissible arc E satisfying the 
equations = 0 , = 0 is given, one cannot conclude without proof that there 

are other arcs with the same properties. If there are no others, the problem of 
Bolza, described in Section 69 and formulated more precisely analytically in 
Section 70 above, is, of course, trivial. The lemma of this section establishes the 
fact that under the hypotheses made on the functions every arc E in Ri satis- 
fying the equations = 0 is imbedded in families of arcs of the same sort. In 
a later section it will be shown that every admissible arc E, satisfying the equa- 
tions of both the sets = 0 and = 0 , is imbedded in families of admissi- 
ble arcs satisfying these equations, provided that E has a certain property called 
“normality.” The following lemma plays an important role in this latter proof, 
as well as in other parts of the theory. 

Lemma 72*1. If an admissible arc E satisfies the equations = 0, and if (1, 
&, ^i(x) is a set of admissible variations satisfying the equations of variation ^0 = 0 
on E, then there is an admissible one-parameter family {71-2) of arcs containing E 
for the parameter value b » 0 , satisfying the equations ~ 0 , and having the set 
( 1 , ( 2 , t 7 i(x) as the variations of the family along E. 

In order to prove this it is desirable first to extend the system of equations 
» 0 to a system of the form 

(721) = = (/3 = 1, •••. m 7 = w + l, ..., »), 

in which the functions yy y') have continuous partial derivatives of at 
least the third order in a neighborhood of the values (x, y, yO belonging to E and 
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are such that the functional determinant of the functions with respect to 
the variables y' is different from zero along E, The functions 4>y can, in fact, be 
chosen linear in the variables y' with coefficients functions of as has been 
shown by Bliss [11, pp. 307, 312]. When the functions yi(x) defining E are sub- 
stituted, the equations (72*1) determine a set of functions Zy{oc) belonging to E 
which are continuous except possibly at the corners of E. Equations (72 • 1) have 
a set of equations of variation 

(72-2) ^>^ = 0, (/3 = 1, ..., f»; y^m + 1, ..., n) 

which determine functions corresponding to the arc E and the functions 
r\i{x) of the lemma. The functions are continuous except possibly at values 
X defining corners of E or at discontinuities of the derivatives of the functions 
mix). 

According to well-known implicit function theorems, the equations (72 - 1) 
have solutions 

y'i-Xtix, y, z), 

for which the functions Xt have continuous partial derivatives of at least the 
third order in a neighborhood of the values x, y, z belonging to £, since the func- 
tions 0^, 07 have such derivatives. Let x' be the first value following xi and de- 
fining a corner of £ or a discontinuity of the functions )7,'(x), or let x' = X2 
if there are no such values and let Ei be the sub-arc of E corresponding to the in- 
terval xix'. The functions Zy{x), iy{x) as defined on the interval Xix' can be ex- 
tended arbitrarily so that they are continuous on a slightly larger interval. The 
second members of the equations 

(72-3) yi = Xi[x, y, Z (*) (*) ] 

are then continuous in x, y, b and have continuous third partial derivatives 
with respect to the variables y,-, 6 in a neighborhood of the values (x, y, 6 = 0) 
belonging to Ei, According to known existence theorems for differential equa- 
tions,® equations (72 *3) have, therefore, solutions 

(72-4) y»= Fi(x, xo, yo, b) 

with initial point (xo, yo) = (^^o, yio, . - . , yno) for which the functions F*, F^ are 
continuous and have continuous partial derivatives of at least the third order 
with respect to the variables yco, 6 in a neighborhood of the sets (x, xo, yo, b) 
belonging to E. The functions 

(72 5) y.= F»[x, xi, y (xi) (xi) , J]=yt(^, b) 

then define an elementary family satisfying the equations 0^ = 0 on an interval 
including XlX^ 

* For these existence theorems see Appen., pp. 269-83, where further references are given. 
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The functions yi(xy b) defined in equations (72-5) have at Xi the initial val- 
ues 

Viixu b)-Yi[xu Xu y{xi)+bifi{xi)y 6] = (a:i) + Jiy*- (xi) , 

and hence their variations ytbixy 0 ) along Ei have at a; = rri the initial values 
rii{x^. Furthermore, the functions (72-5) satisfy equations (72 -3) and therefore 
also the equations 

4>0 — O, (l>y — Zy{x) +b^y{x) . 

Consequently, the derivatives yib{x, 0 ) satisfy the equations (72 *2) along Ei 
and must be identical with the variations rii(x)y since these functions are the only 
solutions of the equations (72 • 2 ) with the initial values ruixi). 

Thus an elementary family of the form (71-1) has been determined, defined 
on an interval including xix\ satisfying the equations = 0 , and having as its 
variations yihix, 0) along Ei the functions rii(x) of the lemma. Let x" be the next 
value following x* and defining a corner of £ or a discontinuity of the functions 
rii(x). It can similarly be proved that the functions 

= Yi[x, x'y y («', J) , b] , 

found from (72 *4) by substituting the initial values xo = x', yto = y*(x', b), de- 
fine a new elementary family on an interval including jr'oc", adjacent to the ele- 
mentary family previously determined on xix'y satisfying the equations = 0 , 
and having the variations iy<(x) along £. A continuation of this process provides 
an admissible one-parameter family (71-2) of arcs in R\ with the properties of 
the lemma. The functions ai;i(A) and ^^ 2 ( 6 ) may be defined by the equations 

Xi{b) =:ri + Afi, X 2 {b) ^Xz + b^z 

where Xi and x% are the end-values of £. 

Corollary 72 1. If an admissible arc E satisfies the equations 0 ^ = 0 and if 
iiaj £ 2 <r, ^ia(x) (o- = 1 , . . . , s) are s sets of admissible variations satisfying the 
equations of variation = 0 along E, then there is an admissible ^-parameter 
family {71-2) of arcs containing E for the parameter values ba == 0 (<r = 1 , . . . , s), 
satisfying the equations <t >0 = 0 , and having for each a — 1 , . . . , s /Ac 
i 7 ia(x) as the variations of the family along E with respect to the parameter b^. 

The proof of this corollary can be made by steps which are so closely analo- 
gous to those of the proof of the lemma that they do not need to be repeated 
here. 

The admissible families (71*2) constructed for Lemma 72*1 and Corollary 
72 *1 have more continuity properties than are required by the definition of an 
admissible family in Section 71. In the definition there given an admissible fam- 
ily is assumed to be one consisting of elementary families whose functions 
y<(^» yi i.Xy b) have continuous first partial derivatives with respect to b at 
6 » 0. The second members of equations (72*4) are continuous in x, yo> b 
and have continuous partial derivatives of the third order in the variables yo, b. 
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It follows readily, then, that the functions yi(x, b), y^Cx, b) of the elementary 
families constructed fqr Lemma 72 • 1 and its corollary will have partial deriva- 
tives of the third order with respect to the variables b continuous in a neighbor- 
hood of the values x, b defining the sub-arc of E contained in the family. With 
the help of a well-known theorem about interchange of order in partial deriva- 
tives® one can show that the order of the differentiations in one of these deriva- 
tives can be arbitrarily changed without changing the value of the derivative. 
These remarks will be useful in later paragraphs, especially in Section 77. 

73. The first variation of 7. When the functions defining an admissible one- 
parameter family (71*2) of arcs are substituted in the expression (69 *4) for /, 
a function J(b) is determined whose first differential is readily calculated, with 
the help of the notations (71*3), to be 

(73*1) dj ^ f dx\\ + dg + f ifyiby^ + fy\8y'.) dx . 

On the arc E corresponding in the family to the parameter value 6 = 0 this 
takes the form dJ = /i({, ri)db, where 

^73-2) /i(f, n) =/fl?+G[{i, >,(*,) a >?(*2)] 

+ /* ’ ( /»,’?. + /»;»» <) dx 

*'*1 

In this expression {i, {21 ^*(^) are the variations of the family along £, and G is 
the linear expression 

(73-3) G= + ^1 + giiVi(xi) + (g2 + yi2g^2^ ^2 + g,2'n^(x2) 

analogous to the function defined for in (71*7). The function /i({, 17) is 
called the first variation of J on the arc E, Its value is completely defined when the 
functions yi{x) {xi^ x ^ X 2 ) defining E and the admissible variations {1, fei 
rji{x) are given. 

The notation F will be used for the function 
(73-4) Fix, y, y', /) =/o/ + /.0, , 

in which the functions 0< are the set 0^, <l>y of equations (72 • 1), lo is a constant, 
and the multipliers /» are functions of x to be determined. If the variations rii(x) 
satisfy the equations = 0, and therefore also the equations (72 -2) with suit- 
ably determined functions f7(3c), the value of t?) will not be altered when 
the sum + ly{^y — ty) is added to its integrand function. From (73 -2) it 
is found then that 

Vi(f, V) + + f\F,,Vi+F,',v',-lyl:r)dx . 

It will be shown in the next paragraph that when constants loyCi{i 1, . . . , n) 
are arbitrarily selected a set of multipliers li{x)y continuous on the interval X 1 X 2 

• See, e.g., Pierpont, The Theory of Functions of Real Variables^ I (1905), 267, sec. 421. 
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except possibly at values * defining corners of E, is uniquely determined by the 
equations 

(73-5) F,'i= f\dx+Ci. 

By the usual integration by parts of the calculus of variations it follows then 
that 

(73-6) v) = Ihfi+Fy'.mVi + ldG- /\r7 dx . 

*1 

This is a reduced form for the first variation (73 • 2), valid for all admissible vari- 
ations {i, f 2 , v%{x) satisfying the equations = 0, and useful in proving the mul- 
tiplier rule of the next section. 

In order to prove that the equations (73 • S) have solutions as described in the 
last paragraph, the new variables 

(73'7) ^ ^ofy\ "f“ ^k4*ky\ 

may be introduced. It can be easily seen then that solution of equations (73*5) 
is equivalent to the determination of continuous functions Vi{x) satisfying equa- 
tions and initial conditions of the form 

(73-8) + 

in which the second members of the differential equations are found by solving 
equations (73 -7) for the U as linear expressions in Zo and substituting in the 
derivatives The coefficients Aiu^ Bi are functions of x continuous on X\X2 
except possibly at the values x\ x ”, . . . defining corners of E, Equations (73 • 8) 
have unique continuous solutions on the interval xix' with the initial values 
Vi(xi) = Ct and unique continuous solutions on the interval x'rr" with initial val- 
ues at x' equal to the end-values at x' of the functions already determined, and 
so on. The result is a set of continuous functions Vi(x) satisfying the conditions 
(73 *8) and defining solutions li{x) of equations (73 *5) by means of equations 
(73 -7). Because of the uniqueness of the solutions of equations (73 *8) for pre- 
scribed initial values Vi(x') at a single point x' it is clear that Zo, Vi(x) do not van- 
ish simultaneously unless they are all identically zero, and the same is true of 
the multipliers Zo, li{x). One can also verify readily the fact that the li(x) have 
unique right and left limits at values x' defining corners of E, since the deriva- 
tives yiix) for E have this property. 

74. The multiplier rule. Let us suppose now that £ is a minimizing arc for 
the problem of Bolza with defining functions 

yi(x) (f = l, ..., n; Xi^x^X 2 ). 

The so-called “multiplier rule” is a first condition which such an arc E must sat- 
bfy and which can be deduced as follows: 
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Consider a set 

(74-1) • {la, {2ri flioM (o’=l> •••> ?+l) 

of ^ + 1 sets of admissible variations all of which satisfy the equations of varia- 
tion = 0 along E. According to Corollary 72 • 1, there exists a (^ + l)-pa- 
rameter admissible family of arcs, 

(74-2) yUx, b) [xAb) ^x^X2(b)], 

containing E for the parameter values ia = 0, satisfying the equations =« 0, 
and having for each tr = 1, . . . , ^ + 1 the set {la, { 2 ^, Vw{x) as its variations 
with respect to b^ along E, When the functions defining this family are substi- 
tuted, the functions /, become functions J{b), \l/f,{b) of the parameters b. The 
equations 

(74-3) J(b) =7(0) +u, 4^Ab) =0 

have the solution (A, u) = (0, 0) corresponding to the minimizing arc E. If the 
functional determinant of the first members of these equations with respect to 
the parameters b is different from zero at the values (A, u) = (0, 0), then ex- 
istence theorems for implicit functions tell us that the equations have unique 
solutions bc(u) continuous near u = 0 and having the initial values Aa(0) = 0. 
But in this case E cannot be a minimizing arc for the problem of Bolza, since 
equations (74-3) show that the arcs of the family (74-2) corresponding to pa- 
rameter values A<r(w) with negative values of u give to 7 values 7(A) which are 
less than the value 7(0) belonging to E. It is evident, then, that for the minimiz- 
ing arc E the functional determinant of the first members of equations (74-3) 
must be zero for all choices of the sets (74- 1), 

From Sections 71 and 73, above, it follows that this functional determinant 
has the form 

7l({a, T7a) 

??ir) 

in which the arguments of have been abbreviated and only the second sub- 
scripts of the variations (74*1) are indicated. Let q < p + the maximum 
rank attainable for the determinant (74 *4) for sets of the form (74 - 1), and let 
(74 - 1) be a set for which this rank is attained. Then there exist constants /o, 
not all zero, satisfying the linear equations whose coefficients are the columns 
of (74 *4). With these constants the equation 

(74-5) Zo7i({, v) v) =0 

must be satisfied for every set of admissible variations { 1 , { 2 , mix) satisfying the 
equations of variation = 0 along E, Otherwise a set 7i({, 17 ), % ({, 17 ), not 
satisfying equation (74 • 5) and adjoined to the matrix (74 • 4) as a column, would 
give a matrix of rank p + ly which by the preceding paragraph is impossible. 


(74-4) 
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When the expression (73 -6) for /o/i is substituted in equation (74 *5), the re- 
sulting equation has the form 

[^o/f I + /0G + f ly^ydx^O. 

•^*1 

The terms outside the integral sign are linear in fi, {2, ^<(^^1), ^i(x 2 )] and the co- 
efficients of the terms in r^^(xl) can be made to vanish by properly selecting the 
arbitrary constants Fy\(xi) = Ct, used in (73 • 5). The remainder of the expression 
must vanish for every arbitrarily selected set fi, {2, ^y(x) with 

tinuous except at a finite number of values x on :ri:r2, since functions tyi^) and 
initial values ri^{x 2 ) determine, by means of equations (72 • 2), a set of admissible 
variations ri^{x). By setting {1 = {2 = VtO^) = 0 it follows from the equation 
above that the last n — m multipliers ly{x) (7 = w -f 1, . . . , vanish identi- 
cally, since the functions iy{x) can be arbitrarily selected. Similarly, by using a 
set for which 0, {2 = = ^7(0:) = 0, it follows that the coefficient of (i 

must vanish, and similarly for the other coefficients. Since lof = F when the 
arc E satisfies the equations = 0, this result is equivalent to saying that the 
coefficients of fi, £2, ^1(^2) in the equation 

(74-6) [FH +Fy>,riy, + W + e,^^ = 0 

all vanish. With the help of some further simple arguments given below, this 
justifies the following theorem: 

Theorem 74* 1, 'Ihe Multiplier Rule. An admissible arc E, defined on an 
interval X1X2, is said to satisfy the multiplier rule if there exist constants lo, not 
all zero, and a function 

(74-7) Fix, y, /, /) =/o/ + fe(«:) 0 = 1, m) 

with multipliers l^(x) continuous on X1X2 except possibly at values x defining corners 
of E where they have well-defined right and left limits, such that the equations 

(74-8) ^vx~ f Fy^dx + Ci, 0 ^ = 0 

are satisfied along E and, furthermore, such that the equations 
(74*9) [ (F — y^jPy^ dx ? + /o dg-\- Cud^n = 0 = 0 

hold at the ends of E for every choice of the differentials dxi, dyn, dx2, dyi2. For an 
arc E satisfying the multiplier rule the multipliers lo, 4(x) do not vanish simulta- 
neously at any point of the interval X1X2. Every minimizing arc E for the problem 
of Bolza must satisfy the multiplier rule. 

The vanishing of the coefficients of dx\, dyn, dx 2 , dy<2 in equation (74*9) is 
called the transversality condition on the arc E. It is equivalent to the vanishing 
of the coefficients of {1, i?i(xi), in equation (74*6), as can be seen by 
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noting that the equation (74 *9) is equivalent to (74 -6) by means of the trans- 
formation 

dxi ^ db , dy.^ = y'ndxi + rn (xi) db , 

dx2 = ^2 db , dy.^ = y[^dx2 + rji M db . 


By collecting the coefficients of dxi, dyiu dx 2 , dyi 2 in (74 9) one sees that the 
transversality condition is equivalent to saying that the matrix 

— i4i + /o^l + A2"{-hg2 5t2+/o^»2|| 

^^'m.2 ^M.i2 II 


(7410) 


has rank less than p + 1, where yli, Bn and ^ 2 , are the values at xi and X 2 
of the expressions 


This matrix form of the transversality condition was given by Bliss [17, p. 693] 
for the problem of Lagrange. The equivalence of the identity (74 • 6) with (74 *9), 
and the remark just made concerning the rank of the matrix (74 - 10), will be 
useful in later sections. 

The multipliers /o, lfi{x) cannot vanish at a point on X1X2, since otherwise the 
functions t;t(x) satisfying equations (73 • 5) and (73 • 8) would vanish simul- 
taneously and each would be identically zero, and the same would be true of the 
functions l${x). Equation (74 *9) would then imply = 0; and consequently 

= 0, since the matrix of derivatives of the functions is of rank p. This 
would, in turn, contradict the fact that the constants /o, in (74 -5) are not all 
zero. 

It is important for the sequel to note that every set /o, h{x ) , whatsoever, hav- 
ing the continuity properties described in the theorem and satisfying the equa- 
tions (74 ■ 8) and (74 • 9) along an arc £, must have all of its elements identically 
zero if /o, hix) vanish simultaneously or if /o, are all zero. In the former case 
the proof just given justifies this conclusion. In the latter case the identity (74 • 9) 
implies Fy^ = 0 at xi, with the consequent identical vanishing of Vi = Fy\, and 
equations (73 • 7) then show that lfi{x) s 0. 

The multiplier rule has the three following important corollaries analogous 
to those which are well known for simpler problems. 

Corollary 74* 1. The Euler-Lagrange Equations. At each point of an ad- 
missible arc E satisfying equations {74- 8) the functions Fyl have forward and back- 
ward derivatives^ equal except at corners and such that 

dFy^, 

(7411) 

Corollary 74-2. The Weierstrass-Erdmann Corner Condition. At each 
corner of an admissible arc E satisfying eqmtions (74 <?) the functions Fyl of 
Theorem 74' 1 have well-defined right and left limits which are equal. 
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Corollary 74-3. The Hilbert DiFFERENTiABiLiTy Condition. On each 
sub-arc of an admissible arc E satisfying the equations <l >0 = 0 and (74 • 8 ) on which 
the functions yi(x) defining E have continuous derivatives and the determinant 


(7412) 



4>ni 

0 


(i, i =» 1, . . . , n; P, y—l, . . 


tn) 


is different from zero, the functions yKx), 1^(x) belonging to E have continuous de- 
rivatives of at least the first order with respect to x. 

The first two corollaries are immediate consequences of the equations (74-8). 
To prove the third one let Xq be a value defining a point interior to a sub-arc 
of E on which the functions yi(a:) defining E have continuous derivatives, and 
let it be such that the determinant (74-12) is different from zero at xt,. The 
equations 

Py’Ax, y(x), U, /Ii] = / F„Jx, y(x), y'(x), I (x) ] dx ■+■ c, , 

•'*1 

<l>fi[x, y(x), u] =0 

have the solutions = yx{x), tifi = lfi{x) along the arc E. The functional de- 
terminant of the first members with respect to the variables «<, /x/s is the determi- 
nant (74 *12) and is, by hypothesis, different from zero at the values {x, y, y) = 
[ofoi yX^Co), /(oco)]. Hence, by well-known theorems concerning implicit functions 
the solutions Ui = y,'(x), will have near continuous derivatives 

with respect to x of as many orders as the functions in the equations have with 
respect to the variables x, Uy y. An examination of the equations shows that this 
number is at least one, and Corollary 74*3 is therefore proved. 

If the functions y^(x)y Ipix) have continuous first derivatives, the functions in 
the last equations have continuous partial derivatives with respect to the varia- 
bles Xy Uy y of at least the second order. Hence the solutions y,'(x), lfi{x) have con- 
tinuous second derivatives with respect to x near of = xo. A continuation of this 
process shows that when the functions f, <l >0 have continuous partial derivatives of 
order k with respect to the variables x, y, y' the functions yKx), l^(x) belonging to a 
minimizing arc £ must have continuous derivatives of order k — 1 with respect to x 
near every value xo defining a point on E at which the determinant {74' 12) is dif- 
ferent from zero. This is an extension of the results stated in the corollary. 

The properties described in Corollary 74 *3 and in the last paragraph hold 
even at corners of a minimizing arc E if one interprets them for one-sided deriva- 
tives only. To show this let xo be a value defining the initial element [xoy y{xo)y 
y{x^y /(oco)] of an arc on which the functions yi{x) belonging to E are continuous, 
and suppose that the determinant R in (74*12) is different from zero at this 
element. On an interval xo < x < x + e the derivatives yi'{x)y lfi{x) exist and 
are solutions of the equations 

= Fyi , 
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found by differentiating the equations (74-8) and = 0* These solutions must 
have unique limits 2 |,s x approaches xo from the right, since the functions y[{x)^ 
lfi{x) have that property. But the well-known mean-value theorem 

- =h'[x, + e(x-Xo)] (0<ff<l) 

X — Xq 

shows that a function A(x) having a derivative on an interval Xo < x < Xo-i- € 
will have a right-hand derivative at xq if h'(x) has a unique right-hand limit at 
X = Xo. Hence the functions yi(x), lfi(x) have right-hand derivatives at x = xo. 
By successive differentiations of the equations (74 • 8) and applications of the 
same reasoning the existence of the forward derivatives of higher orders can 
successively be established. 

There is an equation, indicated in the following theorem, which is sometimes 
useful and which should be adjoined to equations (74-8). So far as is known to 
the writer of these pages, it has not yet appeared in the literature. When the 
minimizing arc consists of a single arc on which the functions yKx), lfi(x) have 
continuous derivatives, this equation can be deduced from equations (74 • 8) or 
from equations (74*11). 

Theorem 74 *2. For every minimizing arc E there exist a function F and con* 
stants Ci, e^, with the properties described in Theorem 74' andy furthermorey a 
constant c stich that the equation 

(74*14) F-y[Fy*,^ fp^dx + c 

*'*1 

holds at every point of E. 

To prove this theorem the equations of the minimizing arc E may be written 
in the parametric form 

(74*15) X ^ t y yi ^ yi (i) (xi ^ t ^ X2) • 

A parametric arc of the form 

(74*16) X = X(0, yi^Vid) (/iS/^/ 2 ), 

with continuity properties in the space of points (/, X, V) like those described in 
Section 70 for admissible arcs in (x, y)-space, will be expressible in non-paramet- 
ric form if Xi(t) is different from zero on the interval /ife. If the elements (/, Xy V, 
Xiy Yt) of the arc (74* 16) are in a sufficiently small neighborhood of the set of 
similar elements of (74*15), then Xt wiU be different from zero along (74*16), 
and the elements (X, Y, Y^/Xt) and [X(/i), F(/i), X(^ 2 ), F(/ 2 )] of (74*16) will 
all be admissible. The arc (74 *15) must therefore minimize the sum 

J = g[X(ti), Y(h), Xih), Y(.h)]+fy[x, Y,^^X,dt 

in the class of such arcs (74- 16) satisfying the equations 
4«{x, F, ^)x, = 0, F(/j), F(fe)l=0. 
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A simple application of Theorem 74-1 now gives equations (74-8) and (74-9) 
for the arc (74- 15), as well as the new equation (74- 14) corresponding to the 
variable X. 

From equation (74' 14) one can deduce two results analogous to those of Cor- 
ollaries 74-1 and 74-2. At every point of a minimizing arc E the expression 
P-viFui has forward and backward derivatives, equal except at comers of E, 
and satisfying the equation 


(7417) 


diP-y'fyO 
dx 




and at each corner of E the function F -- yi Fy\ has forward and backward limits 
which are equal. 

If the arc E consists of a single arc without corners and with continuous de- 
rivatives %•', then equation (74* 14) is a consequence of the equations <f> 0 = 0 
and (74-11). For in that case the derivative of the expression F — yi Fy\ may 
be calculated in terms of y\\ 1 ^ and partial derivatives of F, and it turns out to 
have the value F,, as in (74 -17). Since F -y'iPvi is now continuous, the equa- 
tion (74 -17) implies (74- 14), a conclusion which cannot be thus directly made 
when the arc E has corners. 

In concluding this section attention should perhaps be called to the fact that 
the transversality condition (74-9) is equivalent to saying that the equation 

(74-18) [ (F-y^F,;) dx+Fy>,dyi\\ + k dg^O 

must hold for all sets of differentials dxi^ dyn, dx^j dy ^2 satisfying the equations 
The equivalence of the two statements is deducible from simple alge- 
braic properties of linear equations. Furthermore, the multiplier rule for the 
parametric problem as formulated by Morse takes the form stated in the fol- 
lowing corollary: 

Corollary 74 -4. Far the problem with parametric end-conditions in the form 

(74-19) Xg^ x^ (/i, • • • I /f) > y» ( ^a) ” y*» (^l> • • • > ^r) 

(i ~ Ij . . . , n\ 5 — 1, 2) 

there must exist for every minimizing arc E a function F of the form (74 -7) and 
constants ci such that the equations {74' 8) hold at every point of £ and, further- 
more^ such that the equation {74' 18) is satisfied identically in dth (h = 1, . . . , r) 
when the differentials dx„ dyi«/row equations {74-19) are substituted. 

This is an easy consequence of the application of the multiplier rule in The- 
orem 74-1 to the transformed problem described at the end of Section 69, 
above. 

75. The extremals. As has been shown in the preceding section, a minimizing 
arc for the problem of Bolza not only satisfies the equations 0^ » 0 but also 
must have a set of multipliers /q, lfi{x) with which it satisfies equations (74- 11). 
The solutions of these equations have, therefore, a special significance for the 
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problem. For this reason an extremal is defined as an admissible arc yi{x) with- 
out corners, together with a set of multipliers 

(75-1) yiix), /o=l, k{x) 

(f = l, w; j8 = l, m\ Xi'^x^X 2 ) i 

for which the functions y[{x), l^{x) have continuous derivatives on the interval 
X 1 X 2 and satisfy the equations 0/j = 0 and (74- 11). For such a set of functions 
(75 * 1) these equations can be written in the form 

(75*2) Fyjx "h P viv^yk 4“ P v^u^yk 4“ Pv\i}y ~ Pvi > 0^5 = 0 . 

They are called the differential equations of the extremals. An extremal is said to 
be non-singular if the determinant R in (74 - 12) is different from zero along it. 

Every non-singular extremal E is imbedded in a 2»-parameter family of 
such extremals. This can be seen most readily by introducing so-called canoni- 
cal variables (x, y, z) related to the variables (x, y, y\ 1) by the equations 
(75-3) Zi=^Py[{x, y, y', /), O = 0/j(:r, y, y') . 

The new variables z here introduced should not be confused with the auxiliary 
variables Zy of Section 72. Along E the equations (75 -3) define functions Zi{x) 
which have continuous derivatives on the interval xix^- The functional determi- 
nant of the second members of equations (75 -3) with respect to the variables 
y', I is the determinant R in (74- 12) and is different from zero along E. Hence, 
by the usual implicit function theorems there exists a neighborhood S of the 
values {xy y, 2) belonging to E in which equations (75 • 3) have a solution 
(7S-4) y. z)» y, z), 

reducing to y/(*), h{x) at each point [*, y{x), z{x)] belonging to E and such that 
the functions Pi, Lp have continuous partial derivatives of at least the second 
order, since the second members of equations (75*3) have this property. One 
can now readily verify the fact that in the neighborhood S every solution of the 
differential equations 

(75-5) yi=^Pi(xy y, 2), Zi^fy^ix, y, P, L) 

defines, by means of equations (75*4), functions Ip^x) with which it satisfies 
equations (75 -3) and therefore also equations (75 *2). 

Equations (75 *5) are in the canonical form to which well-known existence 
theorems for differential equations apply. These theorems say that equations 
(75 • 5) have through each initial point (opo, yo, zo) in S one and but one solution 
(75-6) yi=Yi(Xy Xo, yo, Zo), 2 <*Z, (a:, Xq, yo, Zo) . 

The functions Yi, Ft,, Z,-, Zt* have continuous partial derivatives of at least the 
second order in a neighborhood of the set of (2n + 2) -dimensional points (x, 
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* 0 , yof Zo) belonging to E, since the second members of equations (75 -5) have 
such derivatives. None of the solutions (75 * 6) is lost if the value xq is fixed, say 
at xi, since every one of them has an initial point with xq == Xi, With some 
further simple arguments it can now be concluded that for xq ~ Xi, and with the 
notations a, =» y»o, bi = Zio{i — 1, . . . , n), the functions (75 -6) define a family 
of extremals with the properties described in the following theorem: 

Theorem 75 1. The Imbedding Theorem. Every non-singular extremal E 
for a fixed value of the multiplier lo is imbedded for values 

(75*7) Xi^X^X2y GlO, GnO » ^10? •••> ^nO 


in a 2n-parameter family of extremals defined by functions of the form 
(75*8) y,(:r, a, 6), Zi{x, a, b), 


in which a, b are symbols for n-dimensional sets (ai, . . . , an), (6i, . . . , bn). The 
functions yi, yix, Zi, Ziz have continuous partial derivatives of at least the second 
order in a neighborhood of the values (x, a, b) defining E, and the determinant 


(75-9) 







is different from zero along E. In terms of the original variables x, y, y', 1 the ex- 
tremals {7 5 8) care defined by functions 

(7510) yi{x, a, 6), fc(x, a, 6), 

f dated to the functions {75 • 8) by equations (75 • 4) and (75 • J) and such that yi, yiz, 
1^ have continuous partial derivatives of at least the second order. 

The first part of the theorem has been proved except for the statement con- 
cerning the determinant (75*9). By differentiating the identities 

yio=F,(a:i, Xi, yoi «o), s,o=Z,(a:i, Xi, yo, Zo) 

with respect to the variables » y» 0 ) bi = bm it follows that the determinant 
(75 *9) is equal to 1 at the value Xi. In the next paragraph it will be shown that 
if this determinant is different from zero at one value of x^ it is different from 
zero everywhere on E. The properties of the functions (75 • 10) follow at once 
from preceding paragraphs. 

In the neighborhood S of points {x, y, z) described above, a function H(x, y, z) 
may be defined by the equation 

(7511) 

= ZiPi(x, y, 8 ) -Fix. y, P, L). 

From equations (75-3) and simple calculations it follows that the partial de- 
rivatives of this function have the values 
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These equations show that the function H has continuous partial derivatives of 
at least the third order and that equations (75 • 5) are equivalent to the system 

(7513) 


These are called the canonical equations of the extremals. The functions (75 *8) 
satisfy equations (75 *13) identically in rr, a, b. When they are substituted and 
the equations are differentiated with respect to Oky it is found that the deriva- 
tives y»afc, Ziah (i = li • • • > »)» when substituted for qi and f,*, are solutions of the 
linear equations 


(75*14) 


f t “ ^VtVk^k • 


Similarly, every other column of the determinant (75 *9) is a solution of these 
equations. But it is well known from the theory of linear differential equations^ 
that a determinant of 2n solutions of the equations (75 *14) is either identically 
zero or everywhere different from zero. Hence Theorem 75*1 is proved in its 
entirety. 

The function H and the canonical equations (75 *13) of the extremals have 
many applications besides that of the last paragraph. Let 5 be a region of points 
(^) y, a) so related to Ri that to every point (a:, y, z) interior to S there corre- 
sponds a unique solution {x, y, y', /) of equations (75 *3) with {x, y, y') in Ri and 
the determinant (74*12) different from zero. Equations (75*3) then have, as 
before, single-valued solutions (75 *4) with continuous second derivatives over 
the interior of S; and every solution y(x), z{x) of equations (75 *13) in this re- 
gion defines an extremal. Furthermore, if the region S contains all the points 
(x, y, z) corresponding to sets (x, y, y', 1) with (x, y, y') interior to Ri and satisfy- 
ing the equations <l>fi = 0, and with arbitrary values of the multipliers //j, then 
every extremal defines a solution of the canonical equations (75 * 13); and equa- 
tions (75 *2) and (75 *13) are equivalent. 

It should perhaps be remarked in closing this discussion of the extremals that 
the imbedding theorem can also be proved by applying the existence theorems 
for differential equations to the system 

( 7515 ) 

yixi), y'(xi)] =0 , 

equivalent to equations (75*2). The first two of the equations (75*15), solved 
for y{' and define a (2» -|- w)-parameter family of solutions which reduces to 
a 2n-parameter family when the last m conditions at x = xi are imposed. 

’ See, e.g., Gouisat-Hedrick, A Course in Mathematical Analysis^ II, Part II, 153-54. 
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76. Abnormality for minima of functions of a finite number of variables.^ 
The significance of the notion of abnormality in the calculus of variations 
can be indicated by a study of the theory of the simpler problem of finding, 
in the set of points y = (yi, . . . , yn) satisfying a system of equations of the 
form 

#(y) =0 (j 8 = 1 , . . . , 

one which minimizes a function /(y). For a point y® = (y®, . . . , yS) near 
which the functions / and have continuous partial derivatives of at least 
the second order, and which satisfies the equations = 0, we have the fol- 
lowing theorems, some of which are, of course, well known. 

Theorem 76-1. ^4 first necessary condition for f(y®) to be a minimum is that 
there exist constants lo, I/ 9 , not all zero, such that the derivatives Fyi of the function 

F = /o/ + 

all vanish at y®. 

To prove this we have only to note that the determinants of the matrix 

II II 


must all vanish. Otherwise, according to well-known implicit function theo- 
rems, the equations /(y) = /(y®) + u, 4>oiy) = 0 would have solutions y for neg- 
ative values of «, and /(y®) could not be a minimum. 

A point y® has, by definition, order of abnormality equal to q if there exist q 
linearly independent sets of multipliers of the form lo == 0 , Ip having the prop- 
erty of the theorem. When ^ = 0, the point y® is said to be normal, A necessary 
and sufficient condition for abnormality of order q is evidently that the matrix 
||^^y<(y®)|l have rank w — g. At a normal point y® the multipliers /o, Ip of the 
theorem can be divided by h and put into the form /o =* 1, h- In this form they 
are unique, since the non-vanishing difference of two such sets would be a set 
of multipliers implying abnormality. 

Lemma 76* 1. If a point y® is normal^ then for every set of constants rji 
(i s= 1 , . . . , n) satisfying the equations 
(761) 

there exists a set of functions yi(b) having continuous second derivatives near 
b = 0 , satisfying the equations <t>p = 0 , and such that 

yi(0) yUo) = . 

The proof can be made by considering the equations 
(76*2) ^(y)= 0 , 0Y(y) = 0y(y®) + 

(/3«1, i»; y^m + ly ..., n) , 

* For the arguments of Secs. 76 and 77 see Bliss [56]. 



THE MULTIPLIER RULE 


211 


in which the auxiliary functions ^(y) are selected so that they have continu- 
ous second derivatives near y® and make the functional determinant 1 0tyj^(y®) | 
different from zero, and in which the constants f-y are defined by the equa- 
tions 

(76-3) = 

Equations (76*2) then have solutions y,(6) with continuous derivatives of at 
least the second order near 6 = 0 and such that yi(0) = y9. By differentiating 
with respect to 6 equations (76 *2) with these solutions substituted, we find the 
equations 

=0 . 

<hyi (y‘) y'i (0) = ty • 

With equations (76 1) and (76 -3) these show that (0) = 

Theorem 76 -2. 7/ y“ is a normal point and f(y®) o minimum, then the condi- 
tion 

Pv,vk^y^) viVk'^ 0 


must hold for every set % satisfying equations {76' 1), where F = f -(- 1/90^ is the 
function formed with the unique set of multipliers lo = 1, 1/9 belonging to y®. 

The conclusion of the theorem is due to the fact that the function 
g(j) = /[y(6)], formed with the functions y,(6) of the lemma, must have a mini- 
mum at 6 = 0. Since 

<l>eyi[yib)]y[{b) =0 

the derivatives of g(6) are seen to have the values 

g'(J) -fy,[yW]yiib) =F,Jy( 6 )]y:( 6 ) 
g"(0) =Fy,y,(y^)ViVk; 

and for g(0) to be a minimum we must have g"(0) ^ 0. 

Theorem 76' 3. If a point y® has a set of multipliers lo = 1, 1^ for which the 
function F = f + satisfies the conditions 

(76-4) F,,(y®) =0 0 

for all sets ni satisfying the equations 

(76*5) = 0 

then f(y®) is a minimum. 

This can be proved with the help of Taylor’s formula with integral form 
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of remainder. For every point y near y** satisfying the equations ^ = 0 we 
have the equations 

/(y) -/(y") 

(76-6) 0 = ^.(y®) »J< + ^ (1 ~ ViVkdO , 

0 “ r<iHhn(y')mde , 

•'0 

where the sets y' and ij are defined by the equations y/ = y® + tf(yi — y®), i;< 
«= yj — y®. From these we find readily 

/(y) -/(y®) = Ai-fl)F,^*(y')u.Wfl • 

•/q 

Since the quadratic form in the integrand of the last integral, thought of 
as a function of independent variables y and is positive for y' = y® and all 
sets ri satisfying the equations (76*5), it will remain positive for y' = y® + 
0 ^y — ^ 0 ) anj all sets rij including i? = y — y®, satisfying the third set of equa- 
tions (76*6), provided that y lies in a sufficiently small neighborhood N of the 
point y®. Thus we see that for all points y in iV satisfying the equations 0^ = 0 
the difference /(y) — /(y®) is positive. 

Theorem 76 • 1 has the multiplier rule as its analogue in the calculus of varia- 
tions; an analogue of Lemma 76* 1 is Theorem 77 • 1 of the next section; and the- 
orems for the problem of Bolza corresponding to Theorems 76 *2 and 76 *3 are 
proved in Section 80 and Chapter IX. Theorems 76 *1 and 76 *2 give necessary 
conditions for a minimum at a normal point y®, and Theorem 76 *3 gives suffi- 
cient conditions. Since at a normal point the multipliers are unique, the sufficient 
conditions (76*4) are obtained from the necessary conditions merely by ex- 
cluding the equality sign in the condition of Theorem 76*2. Near a normal point 
the minimum problem is never trivial, since Lemma 76 *1 shows that near such 
a point there is always an infinity of other points y satisfying the equations 
0^(y) = 0. The normal points are the non-singular points of the manifold de- 
fined by these equations. 

Abnormal points y® are the singular points of this manifold. Near one of them 
the minimum problem may be completely trivial, as, for example, in the case 
of a point y® which makes » 0 and minimizes in the class of points y satis- 
fying ^ ^ — 0. A simple case is the point (yi, y 2 ) = (0, 0) for the 

problem of minimizing/(yi, y 2 ) in the class of points (yi, y 2 ) satisfying the equa- 
tion 0 = y! -f- yi = 0. 

An idea of the great variety of types of abnormal points may be gained by 
considering the problem of minimizing a function /(yi, y 2 ) in the class of points 
(yif y^) satisfying a single equation 0(yi, y 2 ) * 0, where 0 is arbitrary. It is then 
evident that the variety of abnormal points is at least as great as the variety 
of singular points of an algebraic curve. At an abnormal point the condition of 
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Theorem 76 • 2 is not necessary for a minimum for all sets of multipliers satisfy- 
ing the condition of Theorem 76 * 1, as is shown by the example / =* 2yf — y| = 
minimum, 4> = yfya — y* = and the minimizing point (0, 0). Theorem 76 -3 
does give sufficient conditions for a minimum at an abnormal point, but its con- 
clusions for the abnormal points to which it applies can readily be established 
by the consideration of normal points only, as is shown by Theorem 76-4 
below. In the statement of the theorem, y® is a point which has abnormality 
of order j, v is a variable which ranges over a subset of the numbers 
such that the matrix ||0i>y{(y)|| has rank m — and p is a variable ranging 
over the complementary subset. 

Theorem 76 *4. Let y® be a point which satisfies the hypotheses of Theorem 
76' 3 with a set of multipliers lo — 1, Ip, and let p and p be variables having the 
ranges described in the last paragraph. Then y® is normal for the modified prob- 
lem of minimizing the function g = f + lp4>p in the class of points y satisfying the 
reduced system of equations = 0, and y® satisfies the hypotheses of Theorem 
76 3 for the modified problem with the multipliers lo = 1, L. Furthermore^ if g(y®) 
is a minimum for the normal modified problem, then f(y®) is a minimum for the 
original one. 

We see that the point y® is normal for the modified problem, since the matrix 
l|0vy^(y®)l| has rank m — q. For the function F = g + ly<l>p = / + of the 
modified problem the conditions (76 *4) are satisfied for all sets q satisfying the 
equations 

(76-7) = 

since equations (76 *5) are linear and have a matrix of coefficients of rank 
m — q and hence are consequences of equations (76 *7). The set of points y 
satisfying the equations = 0 includes the points satisfying the complete 
system ^p = 0 as a subclass in which g = /. Hence, if g(y®) is a minimum for the 
modified problem, the value /(y®) = g(y®) must have the same property for 
the original problem. 

From the last theorem it is evident that generality is not lost by proving 
Theorem 76*3 only for points y® which are normal. Such points are, in fact, 
the non-singular points of the class which satisfy the equations 0p = 0, as we 
have seen. Theorem 76 • 4 has Theorem 92 • 2, below, as its analogue for the prob- 
lem of Bolza. 

77. Normality for the problem of Bolza. For the problem of Bolza the class 
ZR of arcs C of the form 

Viix) (i=l, n; Xi^x^X 2 ) 

is defined, as in Section 70, to be the class of admissible arcs C satisfying the 
differential equations ^p « 0 and the end-conditions ^ 0. A^>articular arc E 
of IR is said to have abnormality of order q if it satisfies the multiplier rule in 
Theorem 74*1 with q and only q linearly independent sets of multipliers of the 
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form /(hr =» 0, lfic{x) (a = 1, . . . , g). If g = 0, the arc E is said to be normal; 
otherwise it is abnormal. A set of multipliers /o, l^ix) with /o » 0 will be called an 
abnormal set of multipliers. If a normal arc E has a set of multipliers, then the 
set can always be made over into a set of the form k = 1, hix) by dividing by 
a suitable constant. A normal arc E can have at most one set of multipliers of 
the form /o = hM, since if there were two such sets their difference would 
also be a set of multipliers for E and it would be an abnormal set. 

Theorem n\.If anarcEof the set IK is normal^ then there exists an admissi-- 
ble one-parameter family of arcs of the form {7 12) in HI including the arc E for 
the parameter value b = 0 and including in every neighborhood of E arcs of HI not 
identical with E. 

This theorem shows that for a normal arc E the problem of proving that J{E) 
is or is not a minimum is never trivial, since there are always other arcs of IIT 
near E. A normal arc E in IR may be thought of as in a sense a non-singular arc 
of IR. Abnormal arcs E are singular arcs of IR. It is possible to construct exam- 
ples to show that near an abnormal arc of IR there may be no other arc of IR 
whatsoever. 

To prove the theorem one may make use of the first part of the proof of the 
multiplier rule in Section 74. The normal arc E may be imbedded in admissible 
{p + l)-parameter families of arcs (74 *2), and for such families the maximum 
rank attainable for the last p rows of the matrix (74 *4) must be p. Otherwise 
there would be a set of constants h = 0, e,,, not all zero, effective as in equation 
(74-5) and determining, as in Section 74, a non-vanishing set of multipliers 
/o = 0, Ifiipc) for E. This is impossible, since E is normal. Suppose, then, that the 
variations (74* 1) have been chosen so that the determinant of the first p col- 
umns of the last p rows of the matrix (74-4) is different from zero. Then the 
last p equations in (74 *3) have the form 

(771) b) =0 

if we now use b in place of ftp+i. They have the solution 6i = . . . = 6p = 6 = 0 at 
which their functional determinant with respect to 6i, . . . , bp is different from 
zero. From the usual implicit function theorems they therefore have solutions 
bp *= Bp{b) (/i = 1, . . . , with continuous derivatives near i = 0 and with 
initial values Em( 0) ~ 0. The admissible one-parameter family of arcs obtained 
from the family (74-2) by the substitution bp = Bp{b) contains the arc E for 
6 s 0 and consists of arcs in IR when b is sufficiently small. With the set (i,p+i, 
VhP+i(x) of Section 74 replaced by {i, ( 2 , ViM variations along E of 
the one-parameter family just determined are the functions 

(77-2) f2X(0)+{*, ViM(x)BU0)+v*M . 

If the last n of these are not all identically zero, the family will surely contain 
arcs not identical with E. But when the functions tjip have been chosen to secure 
rank p for the determinant of the matrix (74 *4) specified above, the variations 
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rii can always be selected linearly independent of them, so that the last n varia- 
tions (77-2) are not identically zero. This selection can be made by determining 
the functions corresponding to the variations by means of equations 
(72-2), selecting a set of functions ft(«) linearly independent of the p sets 
tiii{x) (m = 1, . . . , and choosing for the variations y^tix) solutions of the 
equations (72*2) with the functions ii{x) substituted. Thus the theorem is 
proved. 

Corollary 77 • 1. 7/ fi, {2, >?i(x) is a set of admissible variations satisfying the 
equations of variation <l>/j = = 0 along a normal arc E of the class lU, then the 

one-parameter family of arcs in HI imbedding E, described in Theorem 77' 1, can 
be so chosen that it has the set ^ 1 , ( 2 , ViM Us variations along £. 

The one-parameter family constructed for Theorem 77 1 will evidently nave 
the variations required in the corollary if the derivatives ^5^(0) in the variations 
(77 -2) all vanish. The equations (77 *1) with the functions b^ = substi- 
tuted have at 6 = 0 the derivatives 

1 ?) =0 (/X, cr=l, ..., p). 

The last term in each of these equations vanishes because the set fi, { 2 , 
satisfies the equations = 0, and the determinant | \ is different from 

zero. Hence all of the values B!, (0) are zero, and the corollary is proved. 

The one-parameter family of arcs in HI in the corollary was constructed out of 
a sequence of elementary families of the form (71 • 1). Each of these elementary 
families contains a sub-arc of E for values {x, b) satisfying conditions of the form 
x' ^ X ^ x*\ 6=0. The following corollary specifies some further properties 
of the elementary families which will be useful in Section 80, below. 

Corollary 77 • 2. Each of the sequence of elementary families {71 • 7), which to- 
gether form the one-parameter family of arcs in Xfl described in Theorem 77 '1 and 
Corollary 77 1, is defined by functions 

Jiixy b) {x'^x^x*\ |61<€) 

for which the derivatives y,b, y\h ^xist and are continuous in a neighborhood of the 
values (x, b) satisfying the conditions x' ^ x ^ x", b = 0. If the imbedded arc E is 
an extremal^ so that the functions yi(x) defining it have continuous second derivatives ^ 
then the derivatives y,bb, yi'bb, (yibb)' also exist at the values (x, b) satisfying 
x' ^ X g x", b = 0, and the last two derivatives are equal. On each elementary 
family the differentials appearing in equations (77 J) exist and satisfy these 
equations. 

The continuity of the functions y»6, yA is a consequence of the already es- 
tablished continuity near 6 = 0 of the solutions 5,* (6) of equations (77 - 1). The 
existence of the derivatives y^hb, yi'was stated would follow from the existence of 
the second derivatives 5^(0). To show that these second derivatives exist, we 
need the following lemma: 

Lemma 77-1. Let ^(x, b) be a function which with its derivative ^ is continuous 
in a neighborhood of the set of values (x, b) satisfying the conditions x' ^ x ^ x'', 
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b ss 0 and which has its derivative tpx continuous on this set itself. Let x(b) he a 
function having values on the interval and having a continuous derivative near 


b = 0. Then ath = 0 


diplx(b), b] 
db 




This is readily provable by applying the mean-value theorem to the two dif- 
ferences in the second member of the equation 

<plx(b), b] -iplxiO), 0] --iplxib), b] -ip[x(b), 0] 

+ <p[x{b), 0] -^[xiO), 0]. 

An application of the lemma to the function y< (x, b) of one of the elementary 
families is the formula, at 6 = 0, 

(77-3) dy[x(b), b] =y'V + y'. 

The equations for the first derivatives (b), found by differentiating the equa- 
tions (77 • 1) with the functions {b) substituted, are 

4^iipBp ( ft ) "f* yppb ~ 0 y 

where ^mp = drl/Jdbp, If the coefficients in these equations have derivatives 
at ft = 0, the same will be true of the solutions J5i(ft). But the existence of the 
derivatives at ft = 0 of the coefficients can be established by examining the 
functions which occur in them. It is found that all the functions involved are 
known to have derivatives at ft = 0 except y'[a;,(ft), ft], and the derivative of 
this is given by formula (77 -3). 

The existence of the derivative (yihY and its equality with (y0& can be es- 
tablished by an application of the well-known theorem that a function y) 
which with its derivatives ^y, tpxy is continuous near a point (x, y) has a de- 
rivative ifiys equal to ipxv at that point.® Similarly, by successive applications of 
the same theorem it can be shown that (yi»)' exists and is equal to {yi)vb- 
To complete the proof of Corollary 77 -2 it remains to prove the formulas 
(71*3) for differentials. This is simple except in the case of the last one, which 
depends again upon the formula (77 -3). We have, in fact, 

dy = y'djc + by , 

d^y ^ dy'dx + y'd^x+ by*dx + d^y 

=* {y”dx + 8y^)dx + y'd^x + by^dxA- b^y , 

and this reduces at once to the last formula (71*3). 

Besides the theorem and corollaries above, which show that a normal arc E 
of the class lit is always imbedded in one-parameter tamilies of arcs of lit, there 
are a number of other theorems concerning normality which haye useful appli- 
cations. 


* See, e.g., Pierpont, op. cit. 
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Theorem 77*2. The order of abnormality of an arc E of the class lit is always 
less than, or equal to, the numbers m and p, where m is the number of the equations 
(be = 0 and p is the number of etid-conditions = 0. 

For, suppose that /o<r = 0, lea{x) (<r = 1 , . . . , are the q linearly independent 
abnormal sets of multipliers with which the arc E satisfies the multiplier rule. 

q> m, there would be at least one linear combination Woa, whose 

elements would all vanish at xi. But in that case the elements of this set of 
multipliers would vanish identically, as indicated in the second paragraph fol- 
lowing Theorem 74 *1; and the q sets of multipliers could not be linearly inde- 
pendent, as supposed. 

If q>pt there would similarly be a linear combination kalotn W/j<r(x) for which 
an arbitrarily selected set of p of the coefficients of the 2» + 2 differentials 
dxi, dy ti, dx 2 , dyi 2 in the square bracket of (74 *9) would be zero. This set could be 
selected so that the determinant of the corresponding terms in the expression 
for dpn would be different from zero. Since the expression (74-9) is identically 
zero in the differentials dxi, dyn, dx^, dyt 2 , it follows that the constants e^ would 
all be zero and, consequently, that the remaining terms in the square bracket of 
(74*9) would also have coefficients equal to zero. But in that case the deriva- 
tives Fy\ would all vanish at Xi; and it follows that the linear combinations kJoa, 
kahc(x) would vanish identically, contrary to the hypothesis originally made on 
the abnormal sets of multipliers. We can see this since the functions Vi s 0 are 
the only ones which can vanish at Xi and satisfy equations (73 -8) with h = 0, 
and therefore the multipliers //^(x) = 0 are the only ones which can make the 
derivatives Fy\ vanish at Xi and satisfy equations (73 • 5) with Iq = 0. 

Theorem 77*3. If an admissible arc E is a solution of the first n equatiojis in 
(74 ’8) with an abnormal set of multipliers lo = 0, l^(x), then the condition 

(77*4) Fy*^ i = constant 

is true along the arc E for every set of admissible variations T;i(x) satisfying the 
equations of variation = 0 along E. 

The equation 

(77-5) = = 0 

is evidently true under the circumstances described in the theorem. With the 
help of the first n equations (74-8) it follows that the first member of (77 *4) is 
continuous everywhere on the interval xi 0 C 2 belonging to the arc E and has the 
first member of (77 • 5) as its derivative at all values x on XiX 2 not corresponding 
to the corners of the admissible arc E or discontinuities of the functions 17/. The 
conclusion of the theorem follows readily. 

Theorem 77 *4. necessary and sufficient condition for an admissible arc E 
to have abnormality of order q is that for admissible sets of variations £2#! 
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((TO 1, . . . , p) satisfying the equations of variation — 0 <Uong E the maximum 

rank attainable for the matrix 
(77-6) 17.) II 

w p — q. 

To prove this we first note, as in Section 74, that a set /o, with which 

the multiplier rule along the arc E is satisfied will have its elements all identi- 
cally zero if the elements of the set /oi or of the set /o, vanish simul- 
taneously. Consequently, the sets of a collection /o., • • • , s) 

will be linearly independent if, and only if, the sets /o., or the sets /o., €^9 are 
linearly independent. 

For each abnormal set U = 0, Cfg the constants are such that the 
equation 

(77*7) 17) =0 

holds for every admissible set fi, {2, Vi(x) for which the equations = 0 are 
satisfied along the arc JS. This follows from the identical vanishing of the ex- 
pression (74*6) and Theorem 77 *3 for such an abnormal set of multipliers. It 
was seen in the paragraph following Theorem 74- 1 that the identical vanish- 
ing of (74-6) in the variables {1, 17,1, fe, ^*2 is equivalent to the identical van- 
ishing of (74*9) in the differentials dxi, dyny dx^y dy^. 

The number q defined by the maximum rank p ^ qoi the matrix (77*6) is 
the number of linearly independent sets for which condition (77 *7) holds. 
With each such set there is associated an abnormal set h = 0, lfi{x) , with which 
the multiplier rule is satisfied, by the argument applied to (74 *5) in Section 74. 
Thus we see that there are at least q linearly independent abnormal sets of mul- 
tipliers /o = 0, Ifiipc) with which the arc E satisfies the multiplier rule. There 
cannot be more, since otherwise there would be more than q linearly independent 
sets Cn, for which the condition (77 *7) holds, which is impossible. Thus Theorem 
77 *4 is proved. 

The problem of Bolza with fixed end-points is the problem described in Sec- 
tion 69 when the number of independent end-conditions = 0 in (69*3) is 
2n + 2, the same as the number of end-values of the variables Xy yt. The arcs 
satisfying the end-conditions in this case, in particular those of the class HI, all 
have the same end-values at xi and X 2 - 

The multiplier rule of Theorem 74* 1 must, of course, be satisfied by a mini- 
mizing arc E for a problem with fixed end-points, just as for a problem with vari- 
able end-points. The transversality condition (74*9) can now, however, be 
omitted from the multiplier rule, since the constants Cy, are 2n -f- 2 in number 
and can always be determined so that the equation (74 *9) is an identity in the 
differentials dxiy dyny dx%y dyi%. The transversality condition therefore imposes no 
restriction upon a minimizing arc E when the end-points are fixed. 

An arc E is said to have normality y or abnormality of order on an interval 
xV' for a problem of Bolza if it has normality, or abnormality of order a, for the 
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corresponding problem with fixed end-points on that interval. From what has 
been said above, it is evident that an arc E has this property if and only if on 
the interval x'oc" it satisfies equations (74*8) with q and only q linearly inde- 
pendent abnormal sets of multipliers k = 0, l 0 {x). 

With this definition of normality on an interval in mind, the remarks in the 
introduction (Sec. 68) have more meaning. The distinction between normal and 
abnormal arcs for the problem of Lagrange was mentioned by A. Mayer in 
1886'° and was emphasized explicitly by Von Escherich in 1899." Since that time 
many proofs of fundamental theorems for the problem of Lagrange have de- 
pended upon the assumption that the arc E under consideration was normal on 
every subinterval or upon the slightly stronger assumption that E had an 
extension normal on every subinterval.'* As a result of papers by Bliss [17], 
Morse and Myers [24], Graves [29], and McShane [64], one can now prove, with- 
out any assumption whatsoever concerning normality, that the multiplier rule 
and the conditions of Weierstrass and Clebsch, to be proved in the next chapter, 
are necessary conditions on a minimizing arc. In his papers concerning suffi- 
cient conditions for a minimum, Morse [19, 23, 22] was the first to avoid the use 
of a hypothesis on an extension of the interval of the minimizing arc. Hestenes 
[38], Morse [41], and Reid [51] have proved sufficiency theorems applicable 
to arcs which are normal but not necessarily normal on every subinterval, and 
to abnormal arcs of a special type. 

10 “Begriindung der Lagrangeschen Multiplicatorenmethode in der Variationsrechnung,” 
Mathematische Annalenj XXVI (1886), 74. 

“ “Die Zweite Variation der einfachen Integrale (IV Mittheilung),” Sitzungshericktm der 
kaiserlichen Akademie der Wissenschaflen in Wien, CVIII (1899), 1290. 

“ See, e.g., Bolza, Vorlesungen uber Variationsrecknung, p. 603; Hahn, “Bemerkungen zur 
Variationsrechnung,” Mathematische Annalen, LVIII (1904), 152; Bliss [17], pp. 718, 735, 737. 



CHAPTER VIII 

FURTHER NECESSARY CONDITIONS FOR A MINIMUM 


78. The necessary conditions of Weierstrass and Clebsch. In this section 
the method of Graves is to be applied to prove the necessary condition of Weier- 
strass for a normal minimizing arc E for the problem of Bolza. Such an arc sat- 
isfies the multiplier rule with one and but one set of multipliers /o = 1, //jW, 
since the difference of two such sets would have the form h = 0, and be 

identically zero because E is normal. According to Theorem 77 *4, there is a set 
of admissible variations ^29 , “nicix) (or = 1, . . . , satisfying the equations 
of variation = 0 along E and giving the determinant j rja) | a value dif- 
ferent from zero. 

At an arbitrarily selected point 3 between corners on E let Y- (i = 1, . . . , w) 
be a set of values such that the element [xs, yixz), F'] is admissible and satisfies 
the equations 4>fi = 0. It is then possible to select the enlarged system of equa- 
tions (72 * 1) with the continuity properties described in Section 72 near the ele- 
ment [xzy y{xz)y F'] as well as near the arc £, and so that the functional determi- 
nant is different from zero at [xz, yixz), F'] as well as onJE. This will be 
proved in the next section. Equations (72 - 1) now define a set of functions Zy(x) 
associated with the functions yi(x) defining £, and a set of constants Zp to go 
with the set [xzj y{xz)y F']. Furthermore, the equations of variation (72*2) de- 
fine functions f 74 r(:r) (o- = corresponding to each of the sets of admissi- 

ble variations { 2 ®, vur^x) of the preceding paragraph. By the methods of Sec- 
tion 72 we may now infer the existence of three families of admissible arcs 

yiix, b) {xi — 8<x<xzy |6|<€), 

{78-1) YdXy b) ix,^x^x,+ ey \e\<€). 

yi(x, by e) (Xs + e ^ x^ X2+ 6 , |i|<€, \e\<€) 

satisfying on the first and third intervals the differential equations 
(78-2) y:^Xi[Xy y, 

from (72-3), and on the second interval the equations 
(78-3) y. 2 ), 

and having the initial values 

yiixiy b) +iir i7,r(ap,) 

Yiixzy b)^yi{xzy b), 

y»(«s+'«* by «)«F<(«,+ e, 6)- 
220 


(78-4) 
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The systems (78 *2) and (78 -3) are equivalent, respectively, to the systems 

0^ = 0, <l>y==Zy{x)+barjiff{x) 

= 0 , “ Zy , 

and F' is the value of (78-3) for the arguments (xs, y% Z). For values e> 0 the 
three families (78* 1), taken together, form a single admissible (p + l)-param- 
eter family of arcs consisting of a finite sequence of adjacent elementary fam- 
ilies. The functions defining these elementary families and their derivatives with 
respect to x have continuous partial derivatives with respect to the parameters 
by as one sees by the arguments used in proving Lemmas 72 1 and 72 • 2. They 
also have continuous derivatives with respect to the parameter e, since from the 
existence theorems for differential equations it is known that the solutions (72 *4) 
of the equations (72-3) have continuous derivatives with respect to jco, as well 
as the derivatives with respect to the variables yto, b mentioned in the text fol- 
lowing equations (72*4).' 

When b = e = 0 the first and third sets of functions (78 • 1) reduce to the func- 
tions yy(x) defining the arc E. For, like the sets y,(ic), they satisfy equations (78 • 5) 
with 6 = 0 and, from (78 *4), are continuous and have the initial values y*(jci). 
The solutions of equations (78 -5) having given initial values are unique. The 
variations along E with respect to 6<r of the first and third families (78 • 1) satisfy 
the equations of variation (72 • 2) with the functions ^wix) belonging to the vari- 
ations Furthermore, differentiation of the relations (78*4) shows that 

these variations form continuous solutions of equations (72 *2) with the initial 
values r;t<r(xi), and hence they must be identically equal to the solutions rjw{x). 
If we denote by the variations along the arc E of the first and third fam- 
ilies (78 - 1) with respect to the parameter Cy we find that they satisfy the equa- 
tions = 0 and the relations 

tjAx) =0 {xi - S <x^ Xz) 

(78-6) 

y',(xz) +v^ixz) = f ;. 

The latter are found by differentiating the first and third equations (78*4) with 
respect to e. They show that the functions 7fi(x) are, in general, discontinuous at 
the value x^; but this does not affect the validity of the arguments in the follow- 
ing paragraph. The notation fi, { 2 , Viix) will be used for the set = 0, {2 = 0, 
rii(x)- 

When the functions 

(78-7) xAb) ^x, + ba^^ (s^l, 2) 

are used to define end-points on the arcs (78*1) and the corresponding end- 
values of these arcs are substituted in the functions the latter become func- 


^ See Appen., Part II, esp. Sec. 6. 
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tions ^JJb , «) of the parameters b, e. At the values (b, e) = (0, 0) these functions 
have the derivatives 

n)- 

Since the minimizing arc E satisfies the conditions 4/^ = 0, the equations 
(78*8) Mb, e) =0 

have the initial solution (J, e) = (0, 0) at which the functional determinant 

,.)i 

is different from zero. Hence equations (78 -8) have solutions = B„{e) which 
vanish at e = 0 and have continuous derivatives near that value. At e = 0 these 
derivatives satisfy the equations 
(78-9) M., rj) =0 . 

When the substitution ba = Ba(e) is made in the functions (78 • 1) and (78 • 7), a 
one-parameter family of arcs is determined which contains the minimizing 
arc E for the parameter value e = 0. The arcs of the family are admissible for 
sufficiently small positive values of e and satisfy the conditions = 0. 

They are thus in the class ITl. 

When the functions (78* 1) and (78 *7) are substituted in the sum /, it be- 
comes a function of the variables b, e whose value is 

Jib, e)^g{xAb), ylxdb), b], x^ib), ylx 2 ib), b, e]] 

+ y(x, b), y'ix, b)]dx+ f * fix, Y, Y')dx 

f[x, yix, b, e), y'ix, b, e)]dx . 

With the help of the equations = 0, the multiplier rule of Theorem 74 • 1 with 
the transversality condition in the form of the identity (74 *6), the usual integra- 
tion by parts, and equations (78 -6), it is provable that at the values (6, e) = 
(0, 0) the derivatives of / have the values defined by the equations 

f »» ^ I 

(7810) 

n) = [£(*. y, y', I F')]*, 

where 

E{x, y, y\ I, Y') =F(x, y, Y', 1) -F{x, y, y', 1) 

- y, y', 1) 

is the well-known Weierstrass £-function. 
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For small negative values of e the arcs defined by the functions (78 • 1) are not 
admissible. Each of thern has, in fact, three points on each ordinate of the initT- 
yal Xi + e < X < xz. For small positive values of e, on the other hand, they 
are admissible; and the same is true of the arcs of the one-parameter family de- 
fined by substituting in the functions (78*1) and the end-values 

(78 *7). If J{E) is to be a minimum, the values of the sum J on the arcs of this 
one-parameter family must be non-decreasing as e increases from the value zero. 
The derivative at e = 0 of the function J[B(u), u\ is 

/6,(0, 0)5; (0) +7.(0, 0). 

It must be non-negative. From (78*10) and (78*9) the value of this derivative 
is found to be the value of E{x^ y, y', /, F') at the point 3, and the following the- 
orem is proved : 

Theorem 78* 1. The Necessary Condition of Weierstrass. An admissi- 
ble arc E satisfying the equations <t>fi = 0 and the multiplier rule^ with multipliers 
lo = 1, l^(x), is said to satisfy the necessary condition of Weierstrass with these 
multipliers if the condition 

(78*11) E{x, y, y', /, F') ^ 0 

is valid at every element (x, y, y', 1) ofE for all admissible sets (x, y, F') 7^ (x, y, y') 
satisfying the equations = 0. Every normal minimizing arc E for the problem of 
Bolza must satisfy this condition. 

The necessity of the condition for a minimizing arc E has been proved above 
only at elements {x, y, y', 1) between corners of E, But it is evident by simple 
continuity considerations that for a minimizing arc the condition (78 * 11) must 
also hold at corner elements. 

It should be noted here that the proof in the text of the necessary condition 
of Weierstrass is for a normal arc. Quite recently McShane [60] has proved that 
for a minimizing arc £, whether normal or not, there always exists a set of multi- 
pliers /o, hix) with which E satisfies both the multiplier rule and the necessary 
conditions of Weierstrass and Clebsch. 

Let Ti (f = 1, . . . , n) be a set of values satisfying the equations 

(78*12) = 0 

at an element {x, y, y', /) of a normal minimizing arc E. By means of the equa- 
tions 
(78*13) 

these numbers ir,* define n — m further quantities Ky, The equations 
y, p) = 0, <^(x, y, p) = Zy+eKy 

have the initial solution (c, p) = (0, y') and determine a set of solutions pi^e) 
with initial values pi{0) = y/, and with derivatives ^<(0) = t*, since these de- 
rivatives satisfy the equations (78*12) and (78*13) when inserted in place of 
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the numbers n. The sets [x, y, p{^] are all interior to Ri for sufficiently small 
values of c and, according to the last theorem, must satisfy the condition 
E[x, y, y', /, ^(e)] ^0 . 

By simple calculations one readily verifies, with the help of the definition of the 
E-function following (78 *10), that the first member of this last inequality and 
its first derivative with respect to € both vanish at e = 0. Its second deriva- 
tive at € = 0 is the quadratic form Fy\y^,viirhy and this must not be negative. 
Consequently, we have the following theorem: 

Theorem 78*2. The Necessary Condition of Clebsch. An admissible 
arc E satisfying the equations = 0 and the multiplier rule with multipliers lo= 1, 
lfl{x) is said to satisfy the necessary condition of Clebsch with these multipliers if 
the condition 

y* y t 1) 0 

is valid at every element (x, y, y', 1) of E for all sets {vi, . . . , O 5 *^ (0> • • • , 0) 
satisfying the equations 

yy 

Every normal minimizing arc for the problem of Bolza must satisfy this condition. 

79. A lemma and corollary. In Sections 72 and 78 frequent use has been 
made of an enlarged system of equations = 0 , 07 = Zy for which the func- 
tional determinant \<t>iyk\ is different from zero. The existence of such an exten- 
sion of the system 0 ^ = 0 can doubtless be proved in many ways, but in par- 
ticular with the help of the following lemma and its corollary: 

Lemma 79* 1, Let ||a^(x)|| (k = 1, . . . , n; jS = 1, . , . , m < n) fte a matrix 
of functions with well-defined forward and backward limits at each point of an in- 
terval Xi ^ X ^ X 2 and continuous except at a finite number of points of this inter- 
val. If the matrix has rank m at every point of the interval xiX2, then there exists an 
enlarged matrix ||aik(x) 1| (i, k = 1 , . . . , n), with each aykCx) (7 = m + 1 , . . . , n; 
k 1 , . . . , n) a polynomial, and such that the determinant |aik(x) | is different from 
zero on X1X2. 

To prove this it will first be shown that there exists one row of continuous 
functions ak{iz) such that the matrix 

I au{x) II 

is of rank m + 1 on XiX%. Evidently there exists such a row of functions ak{x) 
on an interval xi^x^x^ zo short that on it one of the determinants of 
order m of the matrix ||a 0 k(x)|| remains different from zero. Let { be the least 
upper bound on xiXi for the end-values x' of such intervals, and let X and u range 
over two sets of m values each for which the determinants 

laffy(( + 0)l 
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are different from zero. Then for a sufficiently small € 

\a0K{x)\9^O (t-c<x<{), 

\a 0 Ax) I 5^0 a<x<i+^). 

Let Xix' be an interval with f — € < ac' < { for which the functions ak{x) exist, 
and let j be a value not on the range of X. For all values i 9 ^ j on the range 
1 , . . . , » the definitions of the functions a^ix) may be extended so that 

ai(x) =ai(x') (x'< t)» 


and the definition of aj(x) may be extended continuously on the same interval 
so that 


m(x) 

ax(x) 


a0j(x) 

aj(x) 


5^0 


(x'<xg (). 


If the subscript^* is next selected to be a value not on the range of the subscript/n, 
then by a similar process the definitions of the functions ak(x) may be extended 
continuously over the interval ( < x g f + c; and it is evident that f cannot 
be the upper bound described unless it is at X 2 . By continuing this process, addi- 
tional rows may be added until a matrix ||aifc(jc)|| is reached whose determinant 
is different from zero. If the continuous functions ayk(x) are replaced by poly- 
nomials approximating them sufficiently closely, the result stated in the lemma 
is attained. 

Corollary 79 - 1. The polynomials a^kCx) of the lemma can be so chosen that at 
a finite number of points { on the interval X 1 X 2 , each having associated with it a 
matrix of constants ||b^kll of rank m, the inequalities 


(791) 




5*^0 


(18 = 1, 


m\ 7 = m + 1 , 


. . n; jfe = 1 , . . . , n) 


are satisfied. 

Let X range over a set of m values such that the determinant | | is different 

from zero, and lety be a value not on the range of X, The definition of 0 ^+ 1 , Xz:) 
can be changed slightly near ( if necessary, so that it remains continuous, 
makes 


bfix 

®m+i.x({) 


hi 

flm+l./({) 


^ 0 , 


and leaves the determinant | aik{x) | different from zero on xix^- The same process 
can be repeated for the other points $ so that the matrices (79 *1) all have rank 
m+ 1, Similar modifications for succeeding rows and the subsequent approxi- 
mation to the continuous functions Oykix) by polynomials lead to a matrix 
lk*fc(*)|| with the properties described in the corollary. 

In Older to construct the functions ^{x, y, y') of equations (72*1), let the 
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functions a^kix) of Lemma 79 * 1 be the values of the derivatives 0 / 9 ^; along the 
arc E. The functions 

(79-2) 4>y{x, y, y) ^ aykix)^^^, 

formed with the functions Oykix) of the lemma, then have the properties de- 
scribed in Section 72. The constants y{xzi, F'(a; 8 )] in the second para- 

graph of Section 78 may be denoted by and the value Xz by The functions 
(79*2) formed with the functions Oykix) of Corollary 79 1 satisfy the require- 
ments for equations (72 • 1) prescribed in Section 72. 

80. The second variation and a fourth necessary condition for a minimum. 
In this section we shall consider an arc E of the class HI which is an extremal and 
satisfies the multiplier rule of Theorem 74* 1 with a set of multipliers of the form 
h — 1) h{x). We suppose, furthermore, that E is defined, as usual, by functions 
of the form 

(80*1) yi{x) (t=l, ..., n\ Xi^x^xz) 

and that E is imbedded in a one-parameter admissible family of arcs in HI, 
(80-2) yi^yi^x, b) [xAb)^x^xAb); | 6 |<€], 

having the continuity properties of Corollary 77 *2 which are sufficient to justify 
the deduction of the second variation in the following paragraphs. The variations 
of the family (80-2) along the arc E will, as before, be denoted by fi, { 2 , 

If the arc E is a normal non-singular minimizing arc without corners, it will 
certainly have the properties just described, as we shall see in the paragraph 
following Theorem 80 - 1. But the second variation turns out to be well defined 
on every extremal arc E of the class HI with multipliers of the form /o = 1, h(x ) . 

The value of the sum J on the arc (80 *2) is a function J(b) with a first differ- 
ential (73 - 1), which can easily be put into the more convenient form 

dJ~dg + e^d,^^+ lFdx]l+f\Fy, hyi+Fy'. iyi)dx 

by adding to (73* 1) terms which vanish. This formula holds along every arc of 
the family (80*2). Along the particular arc £ where 6 — 0 the usual integration 
by parts shows that the value of dJ is zero, since E satisfies the multiplier rule 
in Theorem 74* 1. 

With the help of Lemma 77 * 1 applied to the function 

6 ), /(^, 6 ), /(*)], 

thought of as a function of x and 6 , the second differential dfj =* J"( 0 )^*> taken 
along the arc £, has the value 

dV - [F dH +E' dx^ -f 2F^. hyi dx -f 2E,,'. hy\ dx]\ + d^g + 

+fJ'[Fy^S*yi+Fy'^i*y[+2i^{x, iy, iy')]dx. 
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where 2w is the quadratic form 

2«(*, rtj ijO + . 


The symbol F' stands for the derivative of F with respect to the x which occurs 
in all of its arguments. After calculating this, applying the usual integration by 
parts to the first two terms under the integral sign in the expression for d®/, and 
eliminating the ^-differentials outside the integral sign by means of the equa- 
tions (71*3), it turns out that 

dU = [F d^x + (d^y. - y\ dH) F.«.] ? + 

+ [ (Fx - y'JFy^) dx^ + 2F.. dyi dx] ? + /**2w dx . 


In this expression the coefficients of the terms in d^Xi, (Pyn, d^xt, tPya are zero, 
on account of the transversality condition (74-9); and a final value for d*/ is 
therefore 


dV = [ (Fx - y'^y.) dz» + 2F,. dy.- dx]\ + 2q-\- e^lq^ 


(80-3) 


+ f '2<a{x, by, by')dx , 


where 2q is the quadratic form in djci, dyny dxz, dyi 2 whose coefficients are the 
second derivatives of g and where 2q,i is the similar form for the function In 
terms of the variations 


(80-4) {i = a;((0), = (x) = 0) 


the last equation is equivalent to the equation 


with 


d2/=/2(f, 


v) =27l{i, riiixi), 

(80-5) 


$ 2 , 

+ f *2«(x, Tj, iy') dx , 


where 27[dxi, dyn, dx 2 , 5y*2] stands for the terms outside the integral sign in 
(80*3) with the differentials dxi, dy»i, dx 2 , dy,2 expressed in terms of dxi, Byn, 
dx 2 f Syi 2 by means of the equations (71*3). 

The expression /2(f, v) is called the second variation of the sum J along the 
arc £, It is well defined for admissible sets {i, {2 , t?»(x) along every extremal arc 
with multipliers of the form /q = 1, /^(x) . It is said to be non-negative if it is posi- 
tive or zero for every admissible set of variations {1, {2, ViM satisfying along E 
the conditions = 0. It is positive definite if it is non-negative and van- 

ishes only when the elements of the set fi, {2, i7t(x) vanish identically. With the 
help of the results of the preceding paragraphs the following fourth necessary 
condition for a minimum can now be easily justified: 
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Theorebc 80 * 1. a Fourth Necessary Condition for a Minimum. An ex- 
tremal arc E with multipliers lo == 1, l/ 9 (x) is said to satisfy the fourth necessary 
condition for a minimum if the second variation J 2 ({, v) along E is non-negative. 
Every normal non-singular minimizing arc £ without corners for the problem of 
Bolza must satisfy this fourth necessary condition. 

To prove this, we note first that a minimizing arc E must satisfy the multi- 
plier rule of Theorem 74 * 1. If E is normal, its multipliers can be taken in the 
form /o \ and in that form they are unique, by the remark in the first 

paragraph of Section 77. If E is non-singular and has no corners, it is an extrem- 
al, according to Corollary 74*3 and the definition of an extremal in Section 75. 
Thus with the help of Theorem 77 *1 and Corollaries 77 *1 and 77 *2 it is seen 
that a minimizing arc E of the type described in the theorem can be imbedded 
in a one-parameter family of arcs in HI of the form (80 -2). Since £ is a minimiz- 
ing arc, the function /( 6 ) described above for this family must have a minimum 
/(O) and must have 

/'(O) =0, /"(O) =/ 2 ({, 17 )^ 0 . 

The first derivative vanishes, as has already been proved, and the second de- 
rivative must be non-negative for every set of admissible variations fi, { 2 , r\i(x) 
satisfying the conditions = 0 along £, since every such set is the set of 

variations along E of an imbedding family of the form (80*2), according to 
Theorem 77 • 1 and its corollaries. Thus the proof of Theorem 80* 1 is complete. 

81. The accessory minimum problem. The fact that the second variation 
must be iK)sitive or zero along a normal non-singular minimizing arc E without 
comers, as stated in Theorem 80*1, suggests, as usual, an auxiliary minimum 
problem called the accessory minimum problem. This is the problem of finding, in 
the class of admissible sets of variations { 1 , { 2 , rii{x) satisfying the equations of 
variation = 0 , 'J',, = 0 along £, one which minimizes the second variation 
for -E in equation (80*5). That this problem is equivalent, with slight 
changes, to one of precisely the Bolza type will be seen in the proof of Lemma 
81*3. There is an accessory minimum problem on every extremal arc E with 
multipliers of the form /o = 1 , since on every such arc the second varia- 
tion (80*5) is well defined. We shall assume in this section that the extremal 
arcs E considered have multipliers of this form and are non-singular unless 
otherwise expressly stated. 

The equations of the extremals for an accessory minimum problem, analogous 
to equations (74*11) and — 0, are the equations 

d^\ 

( 8 M) = o ^^ = 0 

dx 

where Q, defined by the equation 

(81*2) 17, ri % X) + 9 
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is seen to be a homogeneous quadratic form in the variables 17,-, i7< , The equa- 
tions (81 * 1) are called accessory differential equations? Canonical variables x, 
y\iy may be defined for them in terms of Xy rn, rjl, by means of the equations 

(81*3) ft 1?, ri'y \)y O^^fiixy 17, 17'), 


whose second members are linear in i7», ril, \fi and have the determinant 






M'i 


0 


Mi 

0 


of coefficients of 17/, X/j. Since the arc E is, by hypothesis, non-singular, this de- 
terminant is everywhere different from zero; and the linear equations (81*3) 
have solutions 

(81-4) i7'=ri.(ac, 17, f), \0 = AfiiXy 17, f) 

linear and homogeneous in the variables ?7i, f The Hamiltonian function ff for 
the second variation, analogous to the function (75 - 11), is defined by the for- 
mula 

fly f) = 

= f,ni-s2(x, 17, n, A). 

It is a homogeneous quadratic form in the variables rjiy f*- with coefficients 
which are functions of x having continuous derivatives on the interval X1X2 on 
which the arc E is defined. By an argument similar to that of Section 75 it fol- 
lows that equations (81 • 1) are equivalent to the canonical accessory equations 

(815) 

analogous to equations (75 *13). 

The systems (81*1) and (81*5) are, in a sense, the equations of variation 
along E of the Euler-Lagrange equations 

dF •. 

(81-6) — = 0 = 0 
dx 

and the canonical equations (75*13) of the original problem, respectively. 
Equations (81 *6) are, in fact, satisfied identically in x and the constants a,-, i»* 
by the functions (75*10) defining the family of extremals in which the non- 
singular arc E is imbedded for values xi"^ x ^ X2y flto, ito- When these functions 
are substituted in (81*6) and the result is differentiated with respect to one of 
the constants a of the set a,-, 5,*, it is foimd that the functions 17* = ytjx, ao, 60), 

• Von Escherich, “Die zweite Variation der einfachen Integrale (I. Mittheilung),” Sitzungs- 
hmchten der kaiserlichen Akademie der Wissenschaften in Wien, CVII (1898), 1236; Bolza, 
Vorlesungen fiber Variationsrechnung, p. 622. 
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\fi » oo, bo) satisfy the equations (8M). Similar differentiations of the 
equations 

*•(«, a, h) ^Fy».[x, y(x, a, 6 ), y'(x, o, b), l(Xy o, b)] 
y(x, a, b) , y'(x, a, i)] 

show that the derivatives f ^ = s,® are related to the derivatives iji = yia, = l0a 
along E by means of equations (81*3) and, hence, that r/i = y<a, f< = 2* satisfy 
the canonical accessory equations (81*5). Thus the columns of the matrix 
(75 • 9) are 2n independent solutions of the equations (81 • 5) . Since they are also 
solutions of equations (75*14), it follows readily that the systems (75*14) and 
(81*5) are identical and that 

(81*7) 2$ ^ • 

One verifies by means of the properties of the family of extremals (75 *8) and the 
remark following equations (75 • 12) that the coefficients of the quadratic form ^ 
calculated for the arc E are functions of x with continuous derivatives of at 
least the first order in a neighborhood of the interval XiX2» 

By a solution of the equations (81*5) is meant a pair of functions i?»(x), ft(:c) 
having continuous derivatives on a neighborhood of the interval XiX2 and satis- 
fying the equations. By differentiating equations (81*5) it follows that such a 
solution will also have continuous second derivatives. The existence theorems for 
differential equations tell us that through each initial element (xo, rjiOf f <o) with xo 
on X1X2 the equations have a unique solution rjt{x)y f»(ic), and, in particular, 
that a solution rji, f whose elements all vanish at a point xo is identically zero. 
Every solution rn, f » is linearly expressible with constant coefficients in terms 
of a fundamental system consisting of an arbitrarily selected set of 2n linearly 
independent solutions. The 2n columns of the determinant (75 *9) form such a 
fundamental system. 

A set fi, {2, is said to satisfy the accessory multiplier rule if there exists a 
set of functions i‘t(x) and constants e® such that the set 77i, f » is a solution of the 
canonical accessory equations and such that the equation 

(81*8) [f< rfij*]!+<i7+eMrf'*V = 0 

is an identity in the differentials ^77,2. The form of this multiplier 

rule is suggested by the multiplier rule for the problem of Bolza used in the proof 
of Lemma 81*3. The accessory minimum problem is said to have abnormality of 
order q if there exist q linearly independent sets of constants and solutions of the 
canonical accessory equations of the form 17,- s 0, f <(x), for each of which the 
equation 

(81-9) = 0 

is an identity in dim, d^i, dria. By an argument similar to that in the para- 
graphs just preceding Corollary 74 - 1 it follows that the q sets f<(*) and the q 
sets fy are separately linearly independent. 



FURTHER NECESSARY CONDITIONS FOR A MINIMUM 231 


Lemma 81*1. The original problem of Bolza and the accessory minimum prob- 
lem have the same order ^ of abnormality. 

According to the definition of abnormality in Section 77, above, there will 
exist exactly q linearly independent abnormal sets of multipliers of the form 
/o = 0, Ifi(x), Cft for which the multiplier rule in Theorem 74 * 1 is satisfied, pro- 
vided that the arc E has abnormality of order q. It is seen, upon examination, 
that for one of these sets the equations (74 *8) and (74 *9) imply the equations 

(8M0) 0,;= rQvidx + a, 4>/j = 0, 

(81*11) [Q,; (dyi -y'.dx)]l+ dp^ = 0 

where the arguments of the derivatives of Q and are x, 17, • = 0, = Ip. Near 

every value x there is a set of m of the equations (81 • 10) which can be solved 
for the multipliers Ifi occurring in their first members, from which it follows that 
these multipliers are continuous and have continuous first derivatives, since the 
arc E is an extremal and therefore has no corners and has continuous second 
derivatives. Equations (81 • 10) are now seen to be equivalent to (81 • 1) with 
s 0, = Ifi, and the identity (81*11) in dxi, dy^x, dxi, dyi 2 is equivalent to 

the identity (81-9) in fi, i7»i, fei ^t2. Consequently, every set of q linearly inde- 
pendent multipliers h = 0, Ifi, e,* satisfying along E the equations (74-8) and 
(74 *9) defines, by means of equations 

(81*12) i7.* = 0, 0, 1) y, y\ 1), 

q linearly independent sets 17* s 0, satisfying the conditions (81-5) and 

(81*9), and conversely. The original and the accessory problems, therefore, 
have the same order of abnormality. 

Lemma 81 • 2. The accessory minimum problem is equivalent to a normal prob- 
lem of similar type obtained by deleting a suitably selected subset of its end-condi- 
tions. 

If the accessory minimum problem has order of abnormality equal to q, there 
exist q linearly independent sets i7wr = 0, f,v(x), e^,r (cr = 1, . . . , q) satisfying 
the canonical accessory equations and making the equations 

(81*13) [ti.riiVi+fy.%^0 (c7 = l, ..., q) 

identities in fi, riiu Vih as one sees with the help of equation (81 • 9) . Since the q 
sets are linearly independent, the matrix \\€fu,\\ has a determinant \€pa\ of 
order q which is different from zero. Consequently, equations (8113) are equiva- 
lent to identities of the form 

(8114) •9,’“ + 

where the range of v is complementary to the range of p on the set p = \,,,.,p. 
From Theorem 77*3, equations (81 * 12) showing that and Fy\ are equal, and 
the last equations it follows that every set of admissible variations fi, 97»(x) 
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which satisfies the equations ^0 = 0^ = 0 will also satisfy the equations 

^0 = 0, = 0, and conversely. Hence the accessory minimum problem with 

the reduced system of equations = 0 , = 0 is equivalent to the original 

one. The reduced problem is normal, since a non-vanishing set 17 * = 0, €„ 

satisfying the canonical accessory equations and making the equation 

[ft rii]\+ep^p = 0 

an identity, would be a part of a similar set rn = 0 , fi(x), €„, €p = 0 making 
(81-9) an identity. This last set would then be linearly expressible in terms of 
the sets 17 , v s 0 , f »>, €»#, €pcr, which is evidently impossible, since the determinant 
I Cpp| is different from zero. 

Corollary 81 • 1. If the matrix of coefficients of and ^2 in the func- 

Hons is of rank two, then the corresponding matrix for the reduced problem has 
the same property. 

This is a consequence of the fact that the system = 0 and the system 

= 0, [fta» 7 i]i = 0 are equivalent by means of the relation (81 • 14). If one of 
them has no non-vanishing solution fi, i 7 »i = 0 , ^ 2 , = 0 , the same must be 

true of the other. 

Lemma 81-3. Every minimizing set fi, { 2 , ^i(x) for the accessory minimum 
problem must satisfy the accessory multiplier rule with at least one set fi(x), €p. 

To prove this, one may replace the accessory minimum problem by one 
which is precisely of the Bolza type. An admissible set ^lix), ^2{x)y rii{x) may 
be defined as one determining in the space of points (Xy fi, { 2 , Vt) a continuous 
curve consisting of a finite number of arcs with continuous derivatives. The 
problem of finding, in the class of admissible sets of this sort satisfying the con- 
ditions 

^^ = 0, {( = 0, f' = 0, 

viXl), IjC**)] =0 

*l — *10 = 0 *2 — *20 = 0 , 

one which minimizes the sum 

/ X, 

2 w(*, ij, v')dx 

is an auxiliary problem of Bolza equivalent to the original accessory problem. 
When the multiplier rule of Theorem 74* 1 is applied, it is found that every ad- 
missible arc for the auxiliary problem is normal and non-singular, since the re- 
duced accessory problem of Lemma 81 *2 is normal and since the arc E is non- 
singular, The equations analogous to (74-8) and (74*9) for a minimizing arc 
of the auxiliary problem imply the multiplier rule for a minimizing arc of the ac- 
cessory minimum problem. 

Attention should be called, at this point, to the notions of conjugate solutions 
and conjugate systems of solutions of the canonical accessoiy equations. For an 
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arbitrarily selected pair of solutions, iy<, f * and 5*, f „ of these equations the ex- 
pression rii f » — a constant, as one sees by differentiating this expression, 

applying the equations (81*5), and using a simple property of quadratic forms. 
If this constant is zero, the two solutions are said to be conjugate solutions, A 
conjugate system of solutions of the canonical accessory equations is a system of n 
linearly independent solutions f every pair of which are con- 

jugate. If a solution rn, f * is conjugate to every one of a system of conjugate solu- 
tions, it is linearly expressible in terms of the solutions of the system. The initial 
values rji(xi)y ^i{xi) of such a solution are, in fact, solutions at x = Xi of the n 
linearly independent equations 

Vtkti ~ ^ (^ = 1, ••., ft) • 

The n linearly independent sets faixi) (j— 1, . . . , w) are, however, also 

solutions, since the system ??iy, f*/ is a conjugate system. Hence, a.tx = xi there 
are relations of the form 

which also hold at every value of x, since a solution J*,* — of the 

canonical accessory equations with elements vanishing at one point vanishes 
identically. Thus the maximum number of linearly independent conjugate solu- 
tions is n. 

It is easy to construct special cases of conjugate systems of solutions of the 
accessoiy equations. A necessary and sufficient condition that a system >;,y, f ,y 
form a conjugate system is that its matrix ||i4iy 5iy|| of initial values at :r = xq, for 
example, shall be of rank n and have the property that the product matrix 
whose elements are the sums A ijBik shall be symmetric. The conditions for this 
symmetry are the equations 

A ijBik *“ A ikBij = 0 (^, ^ = 1, - ••, w) , 

and these are the conditions for conjugacy of the pairs of solutions i/ty, Ui and 
i7»ibi ft*. Two types of conjugate systems which are sometimes useful are those 
which have matrices of initial values of the form ||5,y Ojl or ||0 5»y|l at a point 
X = rro, where \\Bij\\ is the identity matrix. Evidently, for an arbitrarily selected 
value Xo there exists a conjugate system rnk, f > «) whose determi- 

nant 1 17 a (^ 0 ) I is different from zero. 

The following lemma will be useful in Section 90, below. 

Lemma 81*4. If the end-values xi and X 2 for a non-singular extremal arc E are 
not conjugate^ then for es)ery admissible set of variations 7ji{x) satisfying the equa- 
tions of variation = 0 along E there exists an accessory extremal Ui(x), Vi(x) 
sisch that Ui(xi) « i 7 i(xi), Ui(x 2 ) = riii.^ 2 ). 

The end-values Xi and X 2 of E are said to be conjugate if there is an accessory 
extremal for E with elements vanishing at xi and X 2 but not identically zero be- 
tween xi and Xi, Otherwise xi and X 2 are said to be non-conjugate. 
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If the order of abnormality of the arc E on the interval XiXt for the problem 
with fixed end-points is equal to g, there are g and only g linearly independent 
sets of multipliers lo, = 0, ^(x) (<r == l,...,g) with which E satisfies equations 
of the form (74 - 8 ), according to the definition of the order of abnormality for 
fixed end-points in Section 77. If F, = then the equations 

(8115) [Fv,i7.1? = 0 


are satisfied by every admissible set of variations 7n(x) satisfying the equations 
4>, = 0 , as stated in Theorem 77 *3. Equations (81-15) are linear in the end- 
values ViMf ViM and, furthermore, are independent. To see this, one can first 
readily verify the fact that the equations (74*8) with abnormal multipliers 
lor = 0 , Ifia are equivalent to equations (81 • 10 ) with arguments riu, s 0 , 
in the derivatives of Q; and this in turn implies that the functions rjig s 0 , 
= {Xf 0 , 0 , 1) are solutions of the canonical accessory equations. 
Linear dependence of the equations (81 • IS) would imply conditions of the form 
Cativixi) = 0 , and these would in turn imply that the solution of the 

canonical accessory equations has all of its functions zero at Xi and hence is 
identically zero. Since the rank of the matrix is everywhere equal to m, 

the identities = cj$ 9 <l>fiyi ^ 0 would imply that cjfiir = 0 , which contra- 
dicts the linear independence of the multipliers. Equations (81 • IS) are therefore 
independent. 

It can now be verified further that, for a non-singular extremal arc E with 
ends not conjugate and order of abnormality equal to q with respect to the fixed 
end-point problem, the rank of a matrix 


( 8 M 6 ) 


UuiXi) 

UiAX2) 


formed with a fundamental system «i.(a:), vu(x) (j = 1 , . . . , 2 ») of accessory 
extremals belonging to £, is always 2n — q. The rank cannot be more than 
2n — 9 , since the columns of the matrix satisfy the q independent equations 
(81 • 15). If it were less than 2n — 9 , there would be more than 9 linearly inde- 
pendent accessory extremals m = uuc,, Vi » Vuc, with functions Ui vanishing 
at Xi and X 2 . Since the order of abnormality of the accessory problem along E 
is 9 , at least one of these would have its functions Ui not identically zero between 
Xi and X 2 i and this is impossible when the ends of E are not conjugate. 

The conclusion of the lemma now follows easily. Every admissible set of 
functions fii(x) satisfying the equations - 0 has end-values ruixi), vii{x 2 ) sat- 
isfying the equations (81 * IS). The colunms of the matrix (81 - 16) have among 
them 2n — 9 linearly independent solutions of equations (81 * IS) in terms of 
which all other solutions are linearly expressible. In particular, we must have 
rii{xi) « c»uu{xi)f riiixt) = C9Uu{x2) ; and the functions Ui(x) of the accessory ex- 
tremal Ui » Vi » c^Vi, therefore have the same end-values as the functions 



CHAPTER IX 

SUFFICIENT CONDITIONS FOR A MINIMUM 

82. Statement of the sufficiency theorem. The proof of the sufficiency 
theorem to be given in this chapter is an outgrowth of arguments originally made 
by Hestenes [38] and depends essentially upon methods which he devised. His 
proofs have the great advantage that they establish sufficient conditions for a 
normal arc E of the class III to be a minimizing arc, even if E is not normal on 
every subinterval, as was assumed by earlier writers.^ The method of Hestenes 
is applicable also to some abnormal cases the proofs for which can be made to in- 
volve only normal ones, as we shall see in Section 92. 

In order to state the sufficiency theorem later to be proved, it is necessary 
first to define the four conditions I, Ilir, III', IV' which are the basis of the 
sufficiency proof. Conditions I, II, III, IV on an arc E of the class HT are, re 
spectively, the multiplier rule of Theorem 74 *1, the condition of Weierstrass of 
Theorem 78 • 1, the condition of Clebsch of Theorem 78 • 2, and the condition IV 
of Theorem 80 - 1. It has been shown that all of these are necessary conditions 
for a minimum on a normal non-singular minimizing arc E without corners. 
An arc E is said to satisfy the sirengihened condition of Weierstrass if the 
inequality 

E{x, y, /, /, F') >0 

is satisfied for every set {x, y, y\ 1) satisfying the equations = 0 in a neighbor- 
hood N of those belonging to E, and for all admissible sets (Xy y, F') (x, y, y') 

and satisfying the equations <l>fi = 0. The strengthened condition of Clebschy desig- 
nated by III', is the condition of Theorem 78 • 2 with the equality sign excluded. 
The condition IV' is the positive definiteness of the second variation / 2 (f, v) 
along £, as described in Section 80. The theorem to be proved can now be stated 
as follows: 

Theorem 82-1. Sufficient Conditions for a Strong Relative Minimum. 
Let Ebea normal arc, without corners, belonging to the class III which, by definition, 
consists of all admissible arcs satisfying the conditions U E satisfies 

the conditions 1, 11'^, III', IV', it is a non-singular extremal and there exists in 
(x, y)-space a neighborhood F of the arc E such that J (C) > J (E) for every arc C 
of the class Ht in F not identical with E and having its end-points sufficiently near 
to those of E. 

It is to be noted that the theorem does not involve the assumption that E is 
normal on every subinterval but only that E is normal for the problem of Bolza 
under consideration. The stronger assumption is now unnecessary, as a result of 

^ See remarks on pp. 187-88. 
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the relatively recent work of Graves, establishing the necessity of the conditions 
II and III, and the necessity and sufficiency proofs of Hestenes related to the 
condition IV. The calculus of variations theory for a normal arc of a problem of 
Bolza has thus been brought to a stage comparable with that of simpler prob- 
lems of the calculus of variations, as far as conditions necessary for a minimum 
and conditions sufficient for a minimum are concerned. 

The theory when the normality of the arc E is not assumed is in a much less 
satisfactory state. McShane [64] has recently proved, with the help of the the- 
ory of convex sets, the necessity of the multiplier rule and the conditions of 
Weierstrass and Clebsch, with no assumption concerning normality whatsoever. 
The proof is, as yet, rather complicated. Once attained, the result is not surpris- 
ing, since these conditions are closely related first-order conditions. The situa- 
tion with respect to a second-order condition analogous to IV seems to the writ- 
er of these pages quite different. The variety of singular arcs E in the class HI is 
very great, and examples involving only a finite number of variables indicate 
that a necessary condition of the type of the condition IV is not provable for all 
cases. McShane [71, p. 529] has given an example from the calculus of variations 
to substantiate this statement. It is possible that conditions analogous to IV 
can be proved for special types of abnormal arcs £. McShane has also considered 
such cases and has found results for abnormal arcs of order one. 

In order to carry through the proof of the sufficiency theorem, a number of 
lemmas will be developed in Sections 83, 84, and 85, below. 

83. An auxiliary theorem. Let us consider three functions xi(u), 
having continuous derivatives on an interval w' g w g w", and a one-parameter 
family of extremals 

(831) yi^yiix, a), = a) 

such^that the functions yi{x, a), a), lfi{x, a) have continuous first partial 

derivatives in a neighborhood of the sets {x^ a) defined by the conditions 

Xi{u) ^x^X2{u) j a = a(u) (u'^u^u''). 

The end-points 1 and Z of the extremal arc (83 • 1) describe two arcs C and D 
when the parameter u varies, the equations of C, for example, being 
x^xx{u), yi-yi[xi{u), a{u)] 

The equations of D are found by replacing Xi{u) by ocjfw) in these formulas. The 
differentials dxy dyi along the arcs C and D are defined by the equations 
(83-2) dx ^ x' (u) du , = v*-, dx + yia da , 

with x(u) replaced, respectively, by Xi(u) and X2{u). 

Along the particular extremal arc defined by a value u the integral / has a 
value 

I{u)^f*Flx, y(x, a), y'(x, a), l(x, a)]dx, 
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whose differential with respect to « is 

dl ='[F dx]\ + da f d* . 

By the usual integration by parts of the calculus of variations one finds fur- 
ther 

dl = [F dx+Fy\yia da] \ , 

and finally, with the help of the relations (83 -2), the formula of the following 
theorem: 

Theorem 83 • 1. The valm of the integral I, taken along a one-parameter family 
of extremal arcs whose end-points 1 and 2 describe two arcs C and D, as described 
above, has the differential 

(83-3) dl= [(F-y;F,;)d*+F,;dy.]5, 

in which the arguments of F and its derivatives are the values x, yi, y[, 1(® belonging 
to the extremal arc, and the differentials dx, dyi are those of the arcs C and D. 

The symbol I* is customarily used for the integral 

I*=f [{F-y[F^)dx+F,’Jy,] . 

By integrating equation (83 *3) from to «" one obtains, then, the following 
corollary: 

Corollary 83 1. If the extremal arcs of the family {83' 1) corresponding to 
the values u' and u" are E 34 and E 56 , respectively, as shown in Figure 83' 1 then 

(83-4) I{E,,)-I{Eu) -/*(C36). 

The theorem of this section and its corollary have many applications in the 
theory of the calculus of variations.^ 



84. Fields and their construction. The definition of a field to be given here 
is one frequently used before by Bliss® and others. 

Definition of a Field. A field is a region F of iicy-space with which there is 
associated a set of slope functions and multipliers pi{x, y), k = 1, y) hav- 
< See Bliss [17], pp. 714-23. > Ibid., p. 730. 
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ing continuous first partial derivatives in F, defining elements (x, y, p) which 
lie in the region iZi and satisfy the equations ^ 0 for every (x, y) in F, and 
making the integral 

(841) i*=f[(F- dx dyt] 


independent of the path in F when the arguments of the function F(x, y, y', /) 
and its derivatives are x, yi, pt(Xy y), k = 1 , lfi(x^ y). 

If the coefficients of the differentials in the int^and of the integral (84-1) 
are denoted by 

A{x, y) Bi{x, y) 


then one can readily prove that the equations 
(84-2) "T 


are identities in the variables x and y. The partial derivatives d/dx and d/dyi 
are taken with respect to the variables x, yi which occur explicitly and in pi{Xy y), 
lfi{x, y), and the total derivative d/dx = d/dx + pid/dyi is taken in the direc- 
tion satisfying the equations 

(84-3) y). 


The conditions 




'Bit , 




’*»i 


are well known to be necessary for the integrri /* to be independent of the path. 
From these conditions it follows, with the help of the identities (84-2), that the 
solutions of the differential equations (84-3) are extremals, with multipliers 
/o = 1 , /s(x) identical along them with the multipliers fo 1 , hix, y) of the 
fidd. The extremals so defined form an n-parameter family, since one and only 
one of them passes through each point (x, 3 '). They are called the extremals of the 
field. Since the integrand oil* reduces simply to F along an extremal of the field, 
we find the following lemma: 

Lehua 84 ‘ 1 . Every fidd F is simply covered by an n-parameter family of extrem- 
als defined by the differential equations {84-3). On an extremal arc of the fidd 
I*(E) = 1(E). 

There are a number of methods of constructing fields, only one of which is 
needed for the sufficiency proofs of this chaptor. It is justified by the proof of the 
following lemma: 

Lsuha 84-2. LdEbea non-singular extremal arc having a conjugate system 
of solutions Uik(x), Vik(x) (k » 1 n) of the accessory equations along £ with 
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determinant | Uik | different from zero on £. Then £ is an extremal of a field F having 
an n-parameter family of extremals 

flli •••! On)> Zi ^ Zi ^Xf llif •••, On) 

containing £ for values (x, a) satisfying conditions of the form 

Xi^x^xt, at = 0 (k ^ I, . . . , n) . 

The functions Yi, Yu, Zi, Ziz have continuous first partial derivatives in a neigh- 
borhood of the values (x, a) belonging to £, and the variations of the family along £ 
have the values 

(84-4) 0) 0) = Y«(*). 

To prove this consider first the 2n-parameter family of extremals 
(84 5) y.*y<(*, a, b), Z 4 = Zi(x, a, b) 

in which the extremal arc E is imbedded for values 

^10 X X 20 1 a^ “ Oio f bi ^(0 ™ f 1 • • ■ 1 

and whose properties are described in Theorem 75 • 1 . Without loss of generality 
it may be assumed that the constants a<, b, are the initial values 
(84-6) o< = y.(*oi a, b), i< = z<(*o, a, b) 

at some value Xo on the x-interval belonging to E. From equations (84-6) one 
finds 

5<* = yi<,*(*oi a, b), 0 = Zi„^{xo, a, b) , 

( QA , 7 \ 

0 = y<5t(*o, a. b), 5 ,i=z, 64 (xo, a, b), 

where da = 1, 8i* — 0 for i^ k.H we define two functions A (a), B(a) by the 
equations 

I A (tt) “ 2o,*Qtti“4" U,‘k(xo) AtUk } 

(84-8) 

ZB (ft) ““2^,0®*"^ Yfjfc ( Xfl) ft,*ftjfc , 

the family (84-5) with = Aai, bi — Bat becomes an »-parameter family of 
extremals 

V< ~ yiCx, Aar Ba) “ Yi (x, ft}, • • • , ft*) i 

(84-9) 

Zi “ Zi(x, Au) Ba) ~Zi (x, fti, • • • , ftn) 

containing the arc E for (fti, . . . , ft,) = (0, . . . , 0) and having, with respect to 
the parameter Ok, the variations 

Yia^^ yiojU +yt»y Yyt(Xo) , 

* ZtayUyjb(Xo) "i" S«6y Yyt(Xo) 

(* = 1. . . . , n). 

For the arc E these variations have at xo the initial values UotCxo), Vik{xo) 
(f 1, . . ff), on account of the relations (84 -7); and th^ satisfy the canonical 
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accessory equations (7S • 14), as was seen in Section 75. Hence, along the arc E 
they are identically equal to Uik(x), Vikix) (f = 1 , . . . , »), as stated in the 
lemma. 

Since the determinant | = | Z7»jfc(^c)| is different from zero along E, the 

existence theorems for implicit functions tell us that there is a neighborhood F 
of the arc E in .ry-space in which the first n equations (84 • 9) have unique solu- 
tions ak(Xf y) vanishing on the arc E and having continuous first partial deriva- 
tives in F. If the multipliers of the family (84 • 9) are denoted by Lfi{x, ai, . . . , an), 
then the functions 

Pi(x, y) = YiAxy a(x, y)] , 

(8410) 

h(x, y) [x, a{x, y) ] 


have continuous first partial derivatives in f. 

It remains to show that in a sufficiently small neighborhood F the integral I* 
formed with the slope functions and multipliers (84*9) is independent of the 
path, and hence that f is a field. On the hyperplane x = xo 

I* ^ fFv\dyi=^ JZr[xQ, a(xo, y)]dYi[x^, a(:ro, y) 1 . 


From (84-6), (84*7), and (84*9) it follows, further, that on this hyperplane 
J* [bio+ (xo) ay] UxkixQ) dak 

~ ^ d [ hiQ JJ xk ( ^o) ^ Yxj (,Xq) l/iki ^o) ayd/c] • 


The last expression is justified by the fact that for the conjugate system Uik, Va 
the equations UijVxk — VxjUxk = 0 are identities and the matrix l|FiyI7»ifc|l is, 
therefore, symmetric. Hence I* is independent of the path on the hyperplane 
X = xq. An arbitrary arc Da% in a sufficiently small neighborhood F of E deter- 
mines a one-parameter family of extremals which cuts the hyperplane x = Xo 
in an arc C 36 , forming a configuration similar to that pictured in Figure 83*1. 
From Corollary 83 • 1 we see that 

(Z>46) = /♦ (C 35 ) +/ (^66) - I (£ 34 ) . 

Each term on the right is completely determined by the end-points 4 and d, 
since /* is independent of the path on the hyperplane x = Xo. The integral I* 
is therefore independent of the path in the whole neighborhood F, and F is a 
field with the slope functions and multipliers (84 *10). 

85. A fundamental sufficiency theorem. In afield the integral I* taken along 
an arc C in («, y)-space is a function of the coordinates xi, yii, xzy Xi 2 of the end- 
points of C only, so that the same is true of the function w defined by the equa- 
tion 

(851) w(xu y.i, *j, y,*) */*(C) ^(* 1 ), *a, 
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The sufficiency proofs to be given in later sections are based upon the following 
theorem: 

Theorem 85 - 1. A Fundamental Sufficiency Theorem. If an arc E is an 
extremal of a field F in (x, yYspace and satisfies the condition 77n of Weierstrass, 
and if the function w(xi, yn, X 2 , yi 2 ) defined by equation {85 • 7) has a proper mini- 
mum at the ends of E in the class of sets (xi, y^, X 2 , yi 2 ) satisfying the equations 
}pfi = 0, then J(E) is a minimum in the sense described in Theorem 82' 1, 

The proof of this theorem is simple. On account of the assumption that the 
arc E satisfies the condition II,v, the field F can be restricted to be a neighbor- 
hood of E so small that all the elements [x, y, pix, y), /(x, y)] belonging to F lie 
in the neighborhood N. Then at every point (x, y) of F the inequality 
E[x, y, p{Xy y), l{x, y) , Y'] >0 

holds for every admissible set (jc, y, F') {x^ y, p) which satisfies the equations 

<l>fi = 0. Furthermore, we can choose a neighborhood N of the ends of E in the 
space of points te, y^, X 2 , yi 2 ) so small that w{xiy y^i, :r 2 , yt 2 ) > w(E) for every set 
(^1, y%h ^ 2 , yiz) in N and not at the ends of E, where w{E) represents the value of 
w at those ends. Then, since I*{E) = 7(£), by Lemma 84 - 1, it follows that for 
every arc C with the properties described in Theorem 82 • 1 
J{C) -J{E) ^nC) +g{C) -I{E) -g{E) 

= I{C) -/*(C) +w{C) -w{E) 

= f E[x, y, p{x, y), l{x, y), y'] rfx 

^ +k(C)-w(£)]. 

Both terms on the right are positive or zero. The Integral vanishes only when 
yl = px{^y y) everywhere on the arc C, in which case C is an extremal of the 
field. When the ends of C are sufficiently near to the ends of E the difference 
w{C) — w{E) is always positive unless the ends of C coincide with those of E. 
The difference J{C) — J{E) is, therefore, positive unless C coincides with E, 
since there is one and but one solution of the equations (84 -3) through an end- 
point of E, Thus the theorem is proved. 

In order to determine whether or not the value w{E) has the minimum prop- 
erty described in the theorem, one may make use of the criteria for a minimum 
of a function of a finite number of variables in Section 76, above. The auxiliary 
function analogous to the function F of that section has the form W = w + e^rp^] 
and its first two differentials with respect to xi, y,!, X 2 , yi 2 as independent varia- 
bles are 

(85-2) dW^[{F- p,Fy',) dx +Fy>^ dy,] ? + ijf + , 

(85-3) dW^2y[dxu dx^, + (ayiSa.)? , 

where iyi and Szi are now simply symbols for the expressions 
(85-4) iyt^dyi — Pidx , S Zi = dFy'^ — Fy.dx . 
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Hie value for dW is an immediate consequence of die definitions of the invariant 
integral I* in (84-1) and of the function w in equation (85 ■ 1). The formula for 
d^W can be verified with the help of the definitions of Syt and Szi in equations 
(85-4), the definition of the quadratic form 2y following equation (80-5), and 
the easUy provable equation 

d(F- PiFy>.) -F,dx+Fy.dyi - PidF^-^ , 

in which the arguments of F and its derivatives are x, y, p(x, y), l(x, y). 

In a field such as is described in Lemma 84 ■ 2 the values of the functions Byt, 
SZi in the formula (85-3) foi* d^W taken along the arc £ will presendy be seen 
to be 

(85-5) iyi = Vik{x) dak , 8 *< ™ V,-*(*) da* , 

where the differendals 

(85-6) dak = aktdx-{-akkidyi 

are the differendals of the solutions at(x, y) of the first n equations in Lemma 
84-2. The first n of the equations (85-5) are immediate consequences of the 
definition of Byi in (85-4) and the equations 
y<= Ff [*, a(*, y)] , 

dy^ = Yitdx + F,«jdat = Pidx-^ Uikdat 

deduced with the help of equations (84- 10) and (84 -4). The last n equations 
(85-5) follow from the second equation (85-4) and the identities 

Ziix, a) =£,'.[*, Yix, a), F'(*, o), L(x, a)], 

ZiA», o) F(*, o), Y'(x, o), L{x, o)], 

Zi,dx +Zi«jda* *= , 

when use is made of equations (84-4) and (84- 10). 

V^th the help of these results the following lemma can now be proved: 
Lemma 85-1. Let f be a field simply covered by an n-parameter family of ex- 
tremals 

y< “ F^ (X, fti, . . . , On) , Z{ ^ Z% (^J, tti, • . • , On) 

containing a particular extremal arc E for values xi ^ x ^ xi, at 0 and having 
the variations 

Yi(x, 0) - Uik(x), Zi(x, 0) - Vik(x) 

along E, as described in Lemma 84-2. Then the value w(£) wiU be a proper mini- 
mum if E satisfies the multiplier ride of Theorem 74-1 with constants and if, 
further, at the ends of E and with the constants e,, the condition 

(85 7) ' Uikixi)ak, (t, Ua(*t) bkj + li{i(xa) bfVik(xt) bk 

- Uu(xi)aiVik(xi)ak>0 
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is satisfied for all sets ({i, ak, fe, bk) 5«* (0, 0, 0, 0) satisfying the equations 
(85-8) %[h, 

This follows with the help of the theorems of Section 76. The expression 
(85 • 2) tor dW vanishes identically in dxi, dya, dxt, dya when E satisfies the multi- 
plier rule with the constants e„. The value (85 -3) of the differential d^W is the 
expression in the first member of the inequality (85 * 7) when 

(85‘9) ii^dxi, Oib=[daik]*, 5*= [da*]*, 

as one sees from equations (85 -5) and (85 -6). With these same substitutions 
the equations (85 -8) become the equations = 0. Since the determinant 
I Vii(x)\ is different from zero along E, the same is true of the determinant 
I Bkyt I for the solutions at of the first equations (84 - 9) . It follows that by means 
of equations (85 • 9) each set dxi, dy,i, dx*, dya determines uniquely a set f i, a*, (j, 
bk, and conversely. From the hypotheses of the lemma it follows, then, that the 
quadratic form d^W is positive for all sets (dxi, dya, dx 2 , dya) ^ (0, 0 , 0 , 0) sat- 
isfying the equations d^,, = 0, and therefore that w(E) is a proper minimum. 

86. The second variation for problems with separated end-conditions satisfy- 
ing also the non-tangency condition. According to the definition in Section 69, 
the function g and the end-conditions = 0 for problems with separated end- 
conditions have the forms 

g = 1*2, y{xt)], 

'I'Axi, y(*i)] =0, y(*2)] =0 

(p = l, ..., r; ..., P) • 

The arc £ to be considered in this section is supposed (1) to be an extremal of 
the class III with multipliers of the form h = 1, le(x ) ; (2) to satisfy the non- 
tangency condition; and (3) to be non-singular. The properties (1) insure the 
existence of the second variation along E. Assumption (2), concerning the non- 
tangency condition, implies that at the ends of E the matrix 

(86-1) U,..i+iPk.ii + y'iixt) II - ll^.i, %.i\\ 

is of rank two, according to the definition in Section 69. When the end-condi- 
tions are separated, this is equivalent to saying that each of the matrices 

ll’«'...||. Il'^-.ll 

is of rank one. On account of assumption (3) concerning the non-singularity of 
the accessoiy canonical variables and equations are well defined. 

The second variation is now found, without difficulty, to have the form 

/2($, i 7 (»i)] — 272[£2, riix%)] + f * 2 (a dx , 
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where 271 and 27 a are homogeneous quadratic forms in their arguments. The 
accessory minimum problem has end-conditions of the form 

^p[ti. v(xi)] =0 (p = 1, , , . , f) , 

ri(x2) ] =0 ((r=r+l p) ; 

and the accessory transversality conditions are also separated into the following 
two groups of identities in drin, drii2 

(86-2) — ft (iXJi) + d7i+6pd^p = 0 , 

(86-3) “ ft ( 3 C 2 ) di}t2+ d72+€4rd^tf = 0 , 

where p and a have the ranges indicated above. 

It is not, in general, possible to find solutions rn, ft of the canonical accessory 
equations and constants (2 which satisfy simultaneously the end-conditions 
^p = = 0 and the conditions imposed by the transversality conditions (86 • 2 ) 

and (86*3). But we may now prove that there is a family of solutions which 
satisfy the end and transversality conditions at the initial point 1 of £, and a 
second family which similarly satisfies the conditions at the end-point 2 of E. 

If derivatives with respect to the variables and rjn are indicated by sub- 
scripts 1 and il, respectively, the conditions Sl'p = 0 and ( 86 * 2 ) are equivalent 
to the « -f r + 1 linear equations 

7i»i + 0 » 

(86-4) — fi ( a!;i) + 7 i»ii +€p^p.ii = 0 , 

^p = 0 

in the 2« + r -f- 1 variables { 1 , rjiM, fite), €p. Equations (86*4) are independ- 
ent, because the rank of the matrix of coefficients of the functions 4^p is r and the 
non-tangency condition requires at least one of the coefficients 4^p,i to be differ- 
ent from zero. Hence there are exactly n linearly independent sets of solutions 
of equations (86 -4) of the form 

^Iki » ftfcC^l) • €p* ~ I» • • • > W) • 

One may verify, with the help of the non-tangency condition and the independ- 
ence of the functions 4^p, that no non-vanishing set of the form ({ 1 , rji, ft*, €p) = 
(ti, 0, 0, €p) can satisfy the equations (86 *4), and hence that the sets Vihixi), 
tikixi) (A « are linearly independent. As initial values they define a 

system of n linearly independent solutions tjikix)^ ^ik(x) of the accessory canoni- 
cal equations. These solutions form a conjugate system, since, when (£ 1 , rn, fi, Cp) 
and ^ 1 , rji, ft, ip) are two solutions of equations (86*4), it follows from those 
equations that sLtx=^xi 

~ Vi ft + £iTi .1 + ViJi * 0 , 

““ Vi ft + f i7i.i + Viyiiii =“ 0 
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in which strokes indicate always stroked arguments. But with the help of a well- 
known property of quadratic forms it is evident that these relations imply the 
equation — f i + f < = 0, which shows that the two solutions 77,, f * and 

are conjugate. 

In the same manner, one may determine a second conjugate system of solu- 
tions Uikix), Vik{x) (fe = 1, . . . , w) of the canonical accessory equations, corre- 
sponding to a set 

^ 2 *, Vik{X2), €ak ft) 

of n linearly independent solutions of the set of conditions found by taking the 
equations = 0 with the further conditions implied by the identity (80 -3). 
If in the expressions 

= riiix) = Gj rj.jix) (xi^x^xti), 

( 86 - 5 ) 

^2==^2kbk, Uiix) = bkU,k(x) (X2^X^X2) 

the constants a, b satisfy the conditions 
(86-6) ^bkU.kix^) 

at an arbitrarily selected value xz on the interval xiXz, then the set (i, fa, 'nt(x) 
defined by the equations (86 • S) on the whole interval XiX2 is a set of admissible 
variations along E, The value of /aCf, *?) on this set is 

2Ti[fi, vixi)] + J' *2o)ix, ft, 'n')dx 

( 86 - 7 ) 

— 272 [{2, uix2) ] + f * 2 o(x Uy w') dx . 

•'*1 

In order to evaluate the sum of the first two terms, it should first be noted 
that by means of equations ( 81 - 4 ) the functions rjiy fi determine multipliers \0(x) 
with which the functions rji satisfy the accessory equations (81 • 1) and that the 
integrand of the first integral in the last expression can therefore be changed to 
2 Q(Xy 17, Tj'y X) by adding the sum By means of the relation 

( 86 * 8 ) 212 = 9 

in which 12x^ = = 0, and with the help of the usual integration by parts, the 

first two terms in the expression (86 * 7 ) are then seen to have the value 

( 86 * 9 ) — Vi(^i) + 2 yi -hep^ + i't ( Xs) Vi(xs) , 

in which the terms €p vanishing by (86 * 4 ), have been arbitrarily added. The 
first equation (86 -2) is an identity in the variables dfi, drfn and will continue 
to be true if these differentials are replaced by fi, rja- It follows that the first 
three terms in (86 - 9 ) add up to be zero, so that the value of the first two terms 
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in the expression (86 ■ 7) for Jt is the sum q i(«j) i'<(«j) . Similarly, it can be shown 
that the last two terms in (86-7) have the value — Vi(xj) udx^, so that 

/«“»Ji(*l) ?<(*t) -«<(*s) Vi(Xf) 

(86-10) 

“f<(*i)«<(*i) -»?. (*i) »<(*»). 

since equations (86-6) at « » imply that ViM ™ *fi(xt). The last expression 
is the value of the second variation along the set of admissible variations 
(u (i, ViM defined on the whole interval XiXt by the equations (86 ■ 5) and (86 - 6) . 
The value (86-10) of Ji must therefore be ^ 0 when Ji is non-negative. We 
can now state the following theorem: 

Theorem 86-1. Consider a problem of Bolsa with separated end-conditions, 
and let "E, he an extremal arc of the class lit for the problem, with multipliers of the 
form lo ~ 1, Ia(x). Furthermore, suppose that E satisfies the non-tangency condi- 
tion and is non-singular. Then there exist two conjugate systems of solutions of the 
accessory canonical equations, rm, i*!] and uik, Vik, determined by the end and trans- 
versality conditions td the end-points 1 and 2 of E, respectively. If the second varia- 
tion Ji((, q) along E is non-negative, the bilinear form 

(86-11) ajilij Uik — riij Vik) bi, 

is non-negative for all non-vanishing sets of constants aj, bk which satisfy tite 
equations 

(86-12) as riiiixt) = 6*«rt(*i) 

at an arbitrarily selected value x, on the interval xixj. 

The theorem is an immediate consequence of the arguments in the preceding 
paragraphs, and in particular of the value (86-10) of the second variation. 

Corollary 86- 1. 7/ in Theorem 86 1 the second variation is positive definite 
and the arc E normal, then the two conjugate systems of the theorem can be chosen 
so that the matrix of the bilinear form (86 - 77) is the identity matrix, and the bilinear 
form aibi will be nod-negative for all non-vanishing sets ai, bi satisfying conditions 
of the form (86-12). 

To prove this we note first that the elements of the matrix in (86-11) are all 
constants, as was shown in the second paragraph following Lemma 81-3. Fur- 
thermore, the determinant of the matrix is not zero under the hypotheses of the 
corollary. Otherwise there would be a set of constants a/, not all zero, such that 
the solution q< aflia, {i ^ ajfu of the accessory canonical equations would be 
conjugate to each of the n solutions ««, and would therefore be linearly ex- 
presable in terms of them in the form bkUik, bkVu- Thus the solution qi, ft would 
satisfy all the accessory end and transversality conditions; and with suitable 
constants |i, the set (i, |t, q<(*) would make A vanish. From the positive 
definiteness of A it would follow that q< ■ 0, which is impossible, since the as- 
. sumption that the arc iS is normal implies that thme is no non-vanishing so- 
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lution rji, of the canonical accessory equations satisfying the accessory end 
and transversality ponditions and having ly,* s 0. 

Since the determinant of the matrix in (86 • 11) is diflFerent from zero, the con- 
jugate system rnj, f*,- can be transformed by a linear transformation with con- 
stant coefficients into a new system which with Utk, Va has the identity matrix 
as the matrix in (86 11). The value of the bilinear form in (86 * 11) must 
be non-negative for all non-vanishing sets of constants a,-, bi satisfying condi- 
tions of the form (86 - 12), on account of the positive definiteness of the second 
variation. 


87. The sufficiency theorem for problems with separated end-conditions 
satisfying also the non-tangency condition. To the hypotheses of the sufficiency 
theorem. Theorem 82 • 1, we now add the assumptions that the problem of Bolza 
under consideration has separated end-conditions and that the arc E to be stud- 
ied satisfies the non-tangency condition. These restrictions will be removed in 
later sections. The first of the lemmas of this section gives a simple algebraic re- 
sult. The essential remaining step in the proof of the sufficiency theorem is con- 
tained in the Lemma 87 -2 due to Hestenes. 

Lemma 87 1. If an admissible arc E satisfies the strengthened Clebsch condition 
11 V with a set of multipliers lo, l/s(x), then the determinant {74 ‘12) formed with 
these multipliers is different from zero at every point of E. 

Otherwise at some value x there would be constants tt*, X^, not all zero, satis- 
fying the linear equations 


= 0 . 

The constants tt* could not all be zero, since the matrix ||<^ 7 vill has rank m and 
the equations therefore imply that all the numbers \y are zero when the con- 
stants Vk all vanish. The equations also have as consequences 

Fy'^TiTk = 0 , (bfiykTTk = 0 , 

which is impossible when the condition III' is satisfied along E. 

Lemma 87 *2. Let E be an arc satisfying the hypotheses of Theorem 86 1 and 
Corollary 86' 1. Then there exists a conjugate system of solutions Uik(x), Vikfx) 
(k = 1, . . . , n) /Ae canonical accessory equations along E, with determinant 
I Uik(x) \ 9 ^^ on the interval XiX 2 belonging to E, and such that the inequalities 

Uii{xi)aj] -Uij{xi)aiVik{xi)ak>0 , 

(871) 

““272[f2, Vij{x2) bj] ’\-Uij{x^) bjVik{Xi) bk>0 


hold for all non-vanishing sets (fi, a), (fe, b) satisfying the equations 


(87-2) 


Uii{xi)ai] =0 , 

Uii{X 2 ) 6y] = 0 , 


the notations being those in {80 -5) and {71-7). 
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To prove the lemma consider the conjugate systems 17 , 7 , and in 

Corollary 86 * 1 , which have the property 
(87-3) djk • 

Then the system which will be proved to have the properties of the lemma is 
(87*4) Uik = ritk-huik , Vik = f»ifc+ Vik • 

This system is evidently a conjugate system, since 

VijU ik ““ Uty Vik ~ itijVik Vij^tk^ "4” “ Vij^ik) 

+ ( V.jriik - Wzyftfc) + ( VijUik - UijVik) . 

The first and fourth terms on the right are zero because ly,/, f ,7 and Vik are 
conjugate systems, and the second and third cancel because the matrix in 
(86 - 11 ) is now the identity matrix. 

The determinant 1 Utk(x) 1 is different from zero on the interval X1X2. Other- 
wise there would be a value x^ and a set of constants oy, not all zero, such that 
Uij ( Xz) aj = rjij ( Xz) Oy + ( Xz) ay = 0 . 

The set (oy, bk) = (ay, —a*) would then satisfy the equations ( 86 * 12 ) 2 it x — 
and would give the bilinear form (86 * 11 ) the negative value —ayay, contrary to 
the property of this form described in Corollary 86 * 1 . 

It remains to show that the conditions (87 *1) are satisfied. It is only neces- 
sary to prove the first of these inequalities, subject to the first condition (87 • 2 ), 
since the proof of the second is similar. To do this we use the notations 

Vik^k > ~ » 


Ui = Uikdk , Vi = Vikdk , 

Ui“U ikdk > Vi ^ V ikO'k • 


The values fi, Ui{x^ satisfy the first set of equations (87 *2) by hypothesis. Be- 
cause of the non-tangency condition it is possible to find auxiliary constants 
(fa, €p) which with (fi, i 7 ii) = [fi, U,(rri)] satisfy equations (86*4). For conven- 
ience in making the proof we may use the notations {xi) for the first n 

of these new constants. Let ^i(x), Ji{x) be the solution of the canonical acces- 
sory equations which has initial values at Xi defined by the equations 


(87*5) 


Vliixi) +rii{Xi) = UiiXi), 
tiixi) +f»(a:,) = Viixi) . 


Then $i, f i also satisfy equations (86 * 4) with suitable constants Ji, ip, since these 
equations are linear. It follows that there are constants dk such that 


fliix) — riik ix) Gk 9 i^i(x) — f tjfc (^) ajfe . 

From the homogeneity of 271 , equations (86*4) with [{i, tii{xi), fi(*i)] = 
Ui{xi)f Fi(aPi)] and suitable values of the constants Cp, and further from 
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equations (87-4) and (87-5), we find that the first member of the first in- 
equality (87 ' 1) has the value 

fi7..i + U<(*,)7i.<i-U.(*i) Vi(xi) - Vi(xi) Uiix,) -Ui(xi) K<(*t) 

= (fi + fi) ivi+^i) — ivi+^i) (f<+ Vi) , 

in which x = Xi everywhere. From equations (87-4) and (87-5) it is clear that 
so that the last expression is equal to 

(fiWi ~ ViVi) + ~ ViVi) + (ftlji ~ Vifi) ~ + . 

The two terms on the right are tlie values of the first two parentheses, by equa- 
tioi^ (87*3); and the third parenthesis vanishes since the solutions tn, and 
rji, are members of the same conjugate system. The sum a«ai is positive or 
zero because of the equations 

Vii(xi) dj-'^i(xi) ^Uiixi) =««(xi)a* 

and the property of the quadratic form (86- 11) stated in Corollary 86-1. The 
constants at are not all zero, since otherwise the first equations (87-2) and the 
non-tangency condition would imply also = 0, contrary to hypothesis. Con- 
sequently, the first inequality (87 - 1) is valid, and the proof of Lemma 87 - 2 
may be regarded as complete. 

It is now possible to conclude the proof that the hypotheses of the sufifidency 
theorem, Theorem 82 ■ 1, justify its condusion for the case when the end-condi- 
tions of the problem of Bolza are separated and the non-tangen(y condition is 
satisfied. An admissible arc without comers which satisfies conditions I and III' 
with a set of multipliers fo = 1, hix), must be a non-singular extremal arc, ac- 
cording to Lemma 87-1, Corollary 74 *3, and the definition of an extremal in 
Section 75. The condition IV' and the non-tangency condition, in addition to 
the others, imply the existence of a conjugate system of solutions !/,•*(*), Vnix) 
(k = 1, . . . , ft) of the canonical accessory equations along E with determinant 
I Utt(z;) I different from zero on the interval xixt belonging to E, and satisfying the 
inequalities (87*1) under the conditions (87*2), according to Lemma 87*2. In 
the field constmcted with this conjugate system and imbedding the non-singular 
extremal arc £, as described in Lemma 84*2, the function w(xi, ya, xt, ya) de- 
fined in (85 * 1) has a minimum at the ends of E. This follows from Lemmas 85 * 1 
and 87 * 2, since for the case of separated end-conditions the quadratic form in the 
condition (85 * 7) is the sum of the first members of the inequalities (87 * 1) and 
the conditions (85*8) are equivalent to the conditions (87*2). From these results 
and the condition Iljlr, assumed in Theorem 82 * 1 to hold along E, it is clear that 
all of the h}qx>theses of Theorem 85 * 1, the fundamental sufficiency theorem, are 
satisfied by the arc E; and the conclusion of Theorem 82 * 1 is therefore justified. 

88. Sufficiency theorems for problems with end-conditions unrestricted. 
It was shown in Section 69 that every problem of Bolza with differential equa- 
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tions, end-conditions, and sum J in the usual forms (69*2), (69*3), and (69*4) 
is equivalent to another similar problem of Bolza having separated end-condi- 
tions and satisfying the non-tangency condition. The differential and end-con- 
ditions for this second problem are shown in (69* 10). We may designate these 
two problems by the letters P and 0, respectively. The purpose of this section 
is to show that the two problems are equivalent and that, if the hypotheses of 
the sufficiency theorem, Theorem 82 * 1, hold for problem P, they will also hold 
for problem Q, Thus the sufficiency theorem which was found in the last section 
for a problem like Q, having separated end-conditions and satisfying the non- 
tangency condition, will also be proved for problem P. 

The arcs involved in problem Q are defined by functions 

yiM, yn+«(») i= 1, . . . , »; o= 1, , n + 2). 

The r^ion for Q analogous to Pi for P is the totality of (4n + 5) -dimensional 
sets 

(x, y», yn+«, y<, yn+*) 

for which (x, y, y') is in Pi and the (2n -f 2) -dimensional set (y 2 n+i, yc, y 2 n+ 2 , yn+c) 
is in the region P 2 for P. The region for Q analogous to P 2 for P is the totality of 
(4» -f 6)-dimensional sets 

[Xu yiiXi), yn+aiXi), ^2, y<(X2), yn+a(Xa)] 

for which the sets 

[y2n+lixi), yi(Xi), y2n+2ixi), yn+t(:»l)] 

are in Pi. With this understanding about regions it is provable that every ad- 
missible arc C satisfying the conditions 0/8 = = 0 for the problem P defines 

uniquely an admissible arc D satisfying the corresponding conditions for the 
problem Q, and conversely. Furthermore, the sums J for the two problems have 
equal values on corresponding arcs, so that the two minimum problems are 
clearly equivalent. 

It remains to show that on an arc C of the class HI the hypotheses of the suffi- 
ciency theorem of Section 82 for problem P imply the analogous hypotheses on 
the corresponding arc D for the problem Q, and vice versa. The functions in- 
volved in the multiplier rules for the two problems have the forms 

( 88 - 1 ) G^kf+1,^, + L^y:^, 

(/J“ 1 0-1 n+2), 

respectively. When the multiplier rules themsdves are formulated with these 
functions, it is found that, if an arc D for the problem Q satisfies the multiplier 
rule for that problem, then a set of multipliers is uniquely determined with 
whidi the correspond^ arc C satisfies the multiplier rde for the problem P, 
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and conversely. Furthermore, if the multipliers are abnormal in one case, they 
are so in the other. Thus the hyp>othesis I and normality for an arc C imply the 
analogous properties for the corresponding arc D. 

The Weierstrass £-f unctions and the quadratic forms involved in the Clebsch 
conditions for the two problems can easily be proved identical, so that the con- 
ditions 11n and III' for one problem imply the analogous conditions for the 
other. 

The second variations along corresponding arcs C and D for the two 
problems are equal. This can be seen from the formula (80-3). The terms in 
parenthesis and the integrand function 2« for the problem Q reduce to those for 
problem P with the help of equations (69 - 10) and the expressions (88*1). For 
the problem Q the quadratic form 2q has the arguments dy 2 »+ife), dyite), 
rfyan+ 2 (i«i), dyn+iix^* With the help of equations (69 10) these are seen to be 
equal, respectively, to the arguments dxi^ dyi{x^^ dx 2 , dyi{x^ for the problem P. 
A similar remark holds for the quadratic form Finally, the equations of vari- 
ation for the problem Q are 

= <+. = 0 . 

*?Sn+l(*l) — l?2n+2 ( *2) ~ ^2 = 0 , 

»»„+<( *2) ->?.•( *2) -y«f 2 =o • 

These are formed from equations (69* 10) as the functions and in (71-5) 
and (71-7) are formed from and They are equivalent to the equations of 
variation = 0 for the problem P. Thus the condition IV' for the prob- 

lem P implies the analogous condition for the problem Q, and vice versa. The 
sufficiency theorem of Section 82 is therefore proved. 

The first three sufficiency theorems indicated in the table of Section 17 are 
also applicable to the problems of Bolza discussed in the present section, pro- 
vided that they are applied to arcs E which are normal and without corners. The 
theorem in the third row has just been established. The theorem in the second 
row, which states that the conditions I, IIjv, IV', E non-singular are sufficient 
for a minimum, is a consequence of the following lemma: 

Lemma 88*1. The conditions //k and IIP are equivalent to the conditions //n 
and E nonrsingular. 

It is evident, as a result of the arguments in the first paragraphs of Section 87, 
that the conditions Iljyr and III' imply IIn and E non-singular. We can prove^ 
that the conditions Il^r and E non-singular imply II^ by showing that at a set 
(«> /i where 

y, y') = 0, Mx, y, F') £(*, y, y', /, F') »0, 

4 Hestenes and Reid [62]. 
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we must have Fg = yi. At such a set the E-function with the atguments (xo, 
yo, h Vo) must have a minimtim at the values yi, lo relative to all the sets y, I 
which satisfy the equations ^ff(xo, yo, y") = 0, on account of the condition lijr. 
According to the results of Section 76, there must be a function £ -f- 0^ 

whose derivatives with respect to yi and l/i all vanish at the point (x, y, y', I, Y')o. 
At this point, therefore, we must have 

“ ( Fj “ yo) F ojoj "f" \> ® » 

-iYl-yk)4>i^’=0 . 

This is impossible unless Fq — y^, since the determinant of coefficients is differ- 
ent from zero on account of the presupposed non-singularity of the arc E. 

To prove that 11^ and the non-singularity of the arc E imply IlF, we note first 
that Ilff implies III, as in Section 78. Let xo be a set of values (tio, . . . , Tno) 
satisf 3 ring the equations 

( 88 - 2 ) « 0 , F„'f^XiXk = 0 . 

On account of the condition HI it is evident that the quadratic form has a mini- 
mum at TO in the class of sets t satisfying the first equations (88 - 2). Hence, by 
results of Section 76 again, there must be a set of multipliers such that the 
function 

has all of its derivatives zero at to. Hence at to 

itatiTk = 0 ; 

and we see, since the arc E is non-singular, that the only set of values which can 
satisfy the equations (88 - 2) is to = 0. This completes the proof of the lemma. 

An arc £ of the class IR is said to make /(£) a weak relative minimum if there 
is a neighborhood N of the sets 
(88-3) (*, y, y') 

belonging to £ such that /(O > /(£) for every arc C of the class lit not identi- 
cal with £, having all of its elements (88-3) in N, and having its end points 
sufficiently near to those of £. 

Theosem 88*1. Far a normal arc E without corners in the class III the condi- 
tions 1, IIV, 1 V' imply that J(E) is a weak relative minimum. 

We may prove the theorem first for a problem with separated end-conditions, 
and this result may then be extended to the more general case by the arguments 
in the first three paragraphs of this section. The reasoning of Section 87 is effec- 
tive with the addition of a proof that there is a nei^borhood N of the sets 
(*> y) on £ such that the condition 
(88-4) £[*, V, p{x, y), l{x, y), F'] >0 
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holds for all admissible sets (x^ y, F') 9^ (x, y, p) in N satisfying the equations 
® where p{xy y) and /(ac, y) are the slope functions and multi- 
pliers of the field constructed in Section 87 with the conjugate system of Lemma 
87 * 2 . With the help of Taylor’s formula with integral form of remainder the 
function (88 * 4 ) can be expressed in the form 

( 88 - 5 ) iriTkf\l-e)Fy^^[x, y, p + o{Y'-p), 1] dS 

with Wi = F- — Pi; and, since the sets (^, y, p) and (x, y, F') satisfy the equa- 
tions <l>fi == 0, we have 

(88*6) O^Tif <l>fiy'.[xy y, p + 6 — p)] dd . 

•'0 


According to the condition III', the quadratic form (88 • S) is positive for all sets 
(x, y, F', v) with (x, y, F') on the arc E, satisfying the equations (88 *6), and 
with ViiTi = 1 . As will be shown in the next paragraph, there is then a neighbor- 
hood N of the sets (x, y, F') belonging to E such that the quadratic form (88 ■ 5 ) 
is positive, and the condition ( 88 - 4 ) therefore satisfied for all sets (x, y, F') 
in N and sets ir 9^ 0 satisfying equations (88 *6). 

To prove the existence of the neighborhood N we note first that the totality 
of sets (x, y, F', ir) with (x, y, F') on E, satisfying the equations (88 *6) and 
having nTi = 1 , is a bounded and closed set 5 . There is therefore a neighbor- 
hood St such that the expression (88 - 5 ) is still positive for all sets (x, y, F', w) 
in St satisfying the equations (88 -6) and with TnTn = 1 . By an indirect proof it 
may be shown that there is a neighborhood N of the sets (x, y, F') on E such that 
the sets (x, y, F', tt) with (x, y, F') in N, satisfying the equations (88 *6) and 
having irtir* = 1, are all in 


89 . The second variation for problems with fixed end-points. In this section 
two lemmas will be established which will be of use in Section 90 . The accessory 
problem of minimizing the second variation /2(f, v) along a normal non-singular 
minimizing arc E without corners was formulated in Section 81 . In the present 
section we shall study the analogous problem of minimizing the integral 


f2(v) = J 2(a(x, 7j, 17') 


dx 


of the second variation in the class // of all admissible variations rji(x) satisfying 
the equations of variation = 0 and the end-conditions vtM = ViM = 0. 
The end-values xi and X2 are now supposed fixed. The arc E, along which the 
integral /2 is taken, is supposed to satisfy the conditions I and III' with a set of 
multipliers of the form /o == 1, /fl(x). 

A value x^ on the interval Xi < Xj ^ X2 is said to be conjugate to xi for this 
problem if there exists a solution «*(«), Vi{x) of the accessory canonical equations 
( 81 - 5 ) with elements Ui{x) vanishing at Xi and x^ but not all identically zero on 
the interval XiXy 
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Lemma 89 * 1 . Jl necessary and sufficient condition for LCi?) to be positive definite 
ih the class H of admissible variations nM satisfying the conditions 
( 891 ) i?i(xi)= 0 , = 0 

is that there exists a conjugate system Uik(x), Vik(x) (k = 1, . . . , n) solutions 
of the accessory equations with | Uik(x) | ^ Q on the interval XiX2. 

To prove the sufficiency of the condition one finds readily, in the first place, 
that the £-function for the integral hiyi) the form 

£u,(x, ri, IT, X, ri') —Fy'^(rii — iri) (rik — Tk) 

with (x, ri') and (x, 17, ir) satisfying the equations 

17, ri') 17, t) =<ftjy;.(i7i — iTi). 

Hence the function £« is positive for such sets when (x, 17, 17') 7^ (x, 17, ir), on 
account of the condition HI'. The ^{-parameter family of accessory extremals 
Vi = Vik ak, it = Vik akj now assumed to exist, simply covers the region of a»7- 
space between the hyperplanes x = xi and x == X2; and in that region the slope 
functions and multipliers Ti(x, 17), \fi(x, 17) of the family form a field for the in- 
tegral hiv)) according to the proof of Lemma 84 • 2 applied to the problem of the 
present section. The fundamental sufficiency theorem. Theorem 85 - 1 , applied 
to this field, justifies the sufficiency of the condition of the lemma. 

The proof of the necessity of the condition in the lemma can be made by the 
method used by Hestenes in proving the first part of Lemma 87 * 2 , but with 
some modifications, because of the fact that the minimum problem here con- 
sidered is not necessarily normal. If the order of abnormality of the axis i7i = 0 
for our present problem is then there will be a maximal set of q linearly inde- 
pendent sets of multipliers Xop = 0, X^p(x) (p = 1, . . . , ^) with wfhich it satisfies 
the accessory equations, or, what is equivalent, a maximal set of q linearly in- 
dependent solutions Vip, iip of the accessory canonical equations having vi(x) = 0 
on the interval XiX2. We consider now the two conjugate systems, 17,*, f and 
Uik, Vik = 1, . . . , n), with elements Vik and Uik all vanishing at Xi and X2, re- 
spectively, and so chosen that the solutions vip, tip (p ~ • • • > 9) are the first q 

solutions of each system. Without loss of generality we may assume that the 
sets tip M (p = 1, . . . , 9) are normed and orthogonalized and that we have the 
relations 

( 89 * 2 ) “0, (^l) v$a (^1) “0 

(ps*l, • • • f q'f a ^ q •••> • 

The presupposed positive definiteness of the integral h{v) implies the non- 
negativeness of the bilinear form (86 *11) for all sets bk^ Ck satisfying equations 
(86 * 12 ) . The matrix of constants in (86 * 11 ) has elements all zero in q rows across 
the top and in q columns on the left, since the solutions i7<p, tip (p ^ 1 9) 

are common to both conjugate systems. The remaining matrix 

( 89 * 3 ) IlftaMip— 17<* (a, /9«9 + 1 , n) 
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has determinant different from zero. Otherwise there would be an accessory ex- 
tremal fii = baViat ia conjugatc to cvery one of the systems «<*, = 1, 

. . . , n), and hence linearly expressible in terms of this latter conjugate system 
in the form CkUik, cmk. The accessory extremal f i so determined would belong 
to both conjugate systems and would therefore satisfy the conditions (89*1). 
The functions rnix) could not all be identically zero, since the set ly^p, f (p = 1, 

. . . , ^) is a maximal set of accessory extremals with elements in vanishing iden- 
tically on the interval Xioc^- Taken along the arc yii{x)y the integral h{y{) would 
vanish, as one sees by the usual integration by parts. But this is impossible 
since the integral is positive definite. Since the determinant of the matrix 
(89*3) is different from zero, we may transform the system (a = ^ + 1, 
...,») linearly so that (89*3) is the identity matrix. 

Let us now define a system U Vik{oo) (^ = 1, . . . , ») by the initial values 

( ^l) “ f ip ( ^x) I F^tp ( ^l) “ 0 I 

(89*4) 

Uia(Xi) =^Via(Xi)+Uia{Xi), ViaiXi) ^ tia(Xi) + ViaiXi) . 

With the help of the relations (89 *2) and the fact that the matrix of constants 
(89 *3) is now the identity matrix, it is easy to see that the system Z7»fc(x), Vik{x) 
(k = 1, . . . , n) is a conjugate system. Furthermore, the determinant \ Uik(x) | 
is different from zero on XiX 2 . Otherwise there would be a value x, and a set of 
constants bk, not all zero, such that 

(89*5) bkUik(x^)--0 (A*l, n). 

The constants 6p would all be zero, as one sees by multiplying the equations 
(89*4) by ^w(Xi) and adding, since from conjugacy relations and (89*2) and 
(89*4) 

fiff ( I^ip ( ^3) ” fi*’ ( ^1) j"ip ( ^1) “ bpv y 
fiaC^a) UiaiXz) =0 . 

In deducing these relations one needs to remember that the functions rupix) are 
all identically zero on the interval xix^. From (89 • 4) the constants (a = ^ + 1, 
• ..,«) would then satisfy the relations 

ba ^y<tt(^8) ~ W*a(3P8) * 

and on the broken extremal 

Vi ( Vi^ ( Xi ^ X ^ X^ y 

— —baUiaix) on Xz^X^X2 

the value (86 • 1 1) of hiv) would be — J*6«, which is impossible since hiv) is posi- 
tive definite. Thus Lemma 89 * 1 is proved. 

Lemma 89 • 2 . i 4 necessary and sufficient condition for 12(1?) to be positive definite 
in the class H is that there is no value Xj conjugate to Xi on the interval xi < x ^ X 2 . 
The necessity follows from the fact that if is conjugate to Xi there is an ac- 
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cessoiy extremal Ui(x), Vi(x) with Ui(xi) = utCx^) == 0 but with Ui(x) not identi- 
cally zero on the interval xix^. The composite arc rii(x) on XiX2 with rji(x) « Ui(x) 
on XiXi and rii(x) s 0 on 0:3*2 would then give to hiv) the value zero, which is im- 
possible since It(ri) is, by hypothesis, positive definite. 

To prove the suflSciency of the condition in the lemma, let { be the least upper 
bound of the end-values *3 of the intervals *1*3 on which the integral /2(^)i taken 
between the limits *1 and *3, is pK>sitive definite in the class of admissible varia- 
tions fii(x) satisfying the equations = 0 and having the end-values = 
ViM 0 « That there are such intervals is a consequence of Lemma 89 - 1 and 
of the fact that there exists a conjugate system Uik(x), Vik(x) (k = 1 , ...,«) of 
accessory extremals with initial values Z 7 »fc(*i) = 5 <jfe, Vik{x^) = 0 and a determi- 
nant I Utjbl consequently different from zero near *1. 

On the interval the integral h(r{) is at least non-negative. Otherwise there 
would be on this interval an admissible set of variations -mix) satisfying the 
equations = 0 and = 0 and having h[n(x)] < 0 . It will be 

shown in the fourth paragraph below that there would then be a family of sets 
of variations *3) vanishing at *1 and *3 < { and such that the inequality 
U Wxy iCj)] < 0 would hold for values *3 sufficiently near to f . This is, however, 
impossible, since on intervals *1*3 with such values *3 the integral hiv) is, by 
hypothesis, positive definite. 

On the interval the integral hiv) is not only non-negative but is also posi- 
tive definite. Otherwise there would be an admissible set of variations satisfying 
the equations = 0, vanishing at xi and f, but not identically zero between 
these values, and giving to the integral hirj) its minimum value zero. Such a 
minimizing set would necessarily satisfy the accessory multiplier rule and be 
an accessory extremal, since the condition III' implies that the arc E is non- 
singular. This is, however, impossible since by hypothesis, is not conjugate 
to *1. 

The least upper bound $ must be at *2. For the positive definiteness of the 
integral hiv) on the interval implies the existence of a conjugate system 
Uik(x)^ Vik(x) (A = 1 , . . . , n) with determinant | Uik(x) | different from zero on 
the interval *if, by Lemma 89 - 1 . This determinant will remain different from 
zero on a slightly larger interval; and by Lemma 89-1 again, it follows that ( 
cannot be the upper bound which it is supposed to be, unless it is at *2- The proof 
of the lemma will now be complete when the postponed arguments of the next 
two paragraphs are supplied. 

If the arc E on which the second variation is taken has abnormality of order q 
on an interval *i{, then it has the same order of abnormality on every interval 
*1*3 with *3 < ( and sufficiently near to (. For in the second paragraph of the 
proof of Lemma 81 *4 it was seen that the matrix 

I Uu(Xi) 

(i*!, 2») 

UiAxn) 
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of a fundamental system of accessory extremals must have rank In — g' if g' 
is the order of abnormality of E on the interval XiXy When x^ = this rank is 
2n — q; and for values x^ sufficiently near to f it is at least 2n — q, because of 
the continuity of the elements of the matrix. But the number of linearly inde- 
pendent accessory extremals ft with ly* = 0 on the subinterval xix^ is at least 
as great as the corresponding number q for the interval XiJi itself. Hence 7 ' = q. 

To prove now the existence of the family of variations ii)i{x, x^ used in the 
third paragraph following Lemma 89 * 2 , let i\x{x) be a set of admissible variations 
satisfying the equations = 0 along E and having = Tyt(f) = 0 for a val- 
ue f on the interval jti < f ^ X 2 » From Lemma 81 -4 it follows that for every 
«3 < { there is an accessory extremal Vi having Ut{x^ = "mixi) = 0 , Ui{x^ = 
riiix^. When x^ is sufficiently near to this extremal can be determined by solv- 
ing a suitably selected system of 2 « — g of the equations 

0 = C,Ux,{Xi), Viix^) = C,UiAX:i) 

for 2 » — g of the coefficients c» while the other q coefficients are chosen to be all 
equal to zero. The functions c^ix^) so determined are continuous and approach 
zero as x^ approaches The variations 

(89*7) riiix, Xi) ^rii(x) - (^ 3 ) (^) 

then vanish for x = Xi and x == x^; and riii^y ^ 3 ) and x^) approach rjiix) 
and ril(x) uniformly for x on the interval Xi^ as x^ approaches f. The family 
(89 • 7) is the one used in the proof of Lemma 89 • 2. 

90. Conditions equivalent to the strengthened fourth condition. It is not 
easy to test directly the positive definiteness of the second variation, which is 
the condition IV' of Section 82. In this and the following section it is proposed to 
establish three equivalent conditions which, theoretically at least, can be more 
easily applied. 

We shall use the notation 1 V[ to designate the condition that the second varia- 
tion is positive definite on the class of all at most once-broken accessory ex- 
tremals satisfying the accessory end-conditions. Similarly, the notation 1 V 2 will 
stand for the condition that the second variation is positive definite on the class 
of all accessory extremals satisfying the accessory end-conditions and, further, 
that there is no conjugate point x^ to Xi on the interval xi<x < X 2 . The defini- 
tion of a conjugate point is given in Section 89. 

THEORESi 90* 1. The conditions IV\ 1V[, IV^ equivalent to each other. 

It is easy to see that IV' implies IV'jand that IV( implies IV 2 . Hence the the- 
orem will be proved if we can show that IV^ implies IV'. 

To make this proof we note first that when IV^ is satisfied the value X2 cannot 
be conjugate to X\, With thi^ remark in mind, let •ni{x) be an arbitrarily selected 
admissible set of variations satisfying the equations =» 0 and the accessory 
end-conditions. According to Lemma 81 *4, there exists an accessory extremal 
Ui{x)xf>t{x)m^Ui{xi) s •mix^^Uiixt) » i 7 »(:k: 2 )* If the second variation is formed 
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with the integrand Q having the multipliers of »<, we may express it, with the 
help of Taylor’s formula, in the form 

•'*1 

The integral vanishes, as one sees by the usual integration by parts. The second 
term on the right is positive unless m = «<, by Lemma 89-2; and the first term 
is positive unless m.- ^ 0, by the hypothesis IVj. Hence the second variation is 
positive definite, and Theorem 90- 1 is proved. 

To test the condition IV^ we may use the criteria which will be described in 
this and the following paragraphs. First of all, we assume that the order of ab* 
normality of the arc E on the interval XiXt is q. A necessary and sufficient condi- 
tion for this to be true is that there exists a maximal set of q linearly independent 
secondary extremals Uifix), Vi^{x) (p = 1 , . . . , 9 ) with elements Ui^ all identi- 
cally zero on * 1 **. Let «<«(*), o<a(*) (a = 9 -|- 1, . . . , 2 «) be 2» — 9 further ac- 
cessory extremals forming with Vip a fundamental system. The equations 
(901) '*'/.[{!, C.UiaiXi), {», c.«, •.(*»)] =0 

have a maximal set of 2 « — 9 -|- 2 — r solutions ({ 1 , c,), where r is the rank 

of the matrix of coefficients of these constants in the equations (90- 1). Corre- 
sponding to the solutions so determined, there is then a maximal set of constants 
( 1 , (i and accessory extremals 

{iTi {w. Virix), j^irix) ( 7 = 1 , ..., 2n-q + 2-r), 

each of which satisfies the accessory end-conditions and has elements rnyix) not 
all identically zero on the interval XiXt. The value of the second variation, taken 
on the elements 

Vi~Virfhr 

of an arbitrary linear combination of these, is a quadratic form Q(z) in the varia- 
bles Zy. The following result can then be readily verified: 

Theoreu 9Q-2.The condition 1 F, is equivalent to the conditions that the quad- 
ratic form Q(z) is positive definite and that there is no value x, conjugate to xi on the 
interval xi < x < xj. 

If the original extremal arc E is normal for the original problem of Bolza, 
then every accessory extremal arc satisfying the accessory end-conditions will 
be normal for the accessory minimum problem, as indicated in Lemma 81 ' 1. 
According to Theorem 77 ■ 1, applied to the accessory minimum problem, there 
will then be an infinity of admissible sets ( 1 , (t, «;<(«) satisfying the accessory 
end-conditions. Both the condition rVj and the equivalent conditions of The- 
orem 90-2 imply that the values Xi and xt are not conjugate. Hence, according 
to Lemma 81 *4, there is an accessory extremal arc «<(«), Vi{x) joining the ends 
of the arc defined by each set of functions 9 <(«), and the set ( 1 , { 1 , Ut(x) will also 
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satisfy the accessory end-conditions. Thus the quadratic form Q(z) must con- 
tain variables z. 

To determine the values conjugate to xi on the interval xxX 2 f we may fol- 
low a procedure of Reid [44, p. 577]. On each interval XiX the arc E has an order 
of abnormality, which we may denote by qix) and which is equal to the number 
of linearly independent accessory extremals Vi with elements Ui all vanishing 
identically on XiX, The function q{x) so defined is non-increasing and has only 
integral values ^ m, according to Theorem 77 *2. Hence it has only a finite 
number of discontinuities on the interval X 1 X 2 , which will be denoted by ta 
(a == 1, . . . , g — 1) with to — Xi < h < . . . < tg^i < tg ^ xo. The accessory 
extremals Ui, Vi with elements Ui all vanishing at xi form a conjugate system 
Uik(x), Vik(x) (A = 1 , . . . , n) whose elements Vikixi) may be normed and ortho- 
gonalized so that 

(90*2) Vii{xi) Vik{xi) ^ bjk = 0 if 

Furthermore, the order of the columns in the system may be so chosen that for 
each a = 1 , . . . , g the first q(ta) of them are the ones whose elements Uip 
[p = 1, . • * } q(ta)] vanish identically on the interval xi^ x ^ ta^ A fundamental 
system of accessory extremals can be formed by adjoining n new accessory ex- 
tremals Ui,n^k(x)^ Vi^n’^kix) (k ^ determined by the initial conditions 

(90*3) ^Vik{Xi)y v.^^^ixi) = 0. 

Finally, for each value a — 1, .. .,q we may define a system Uik(x\a)f a) 
by the equations 

Uig{x\a) Vigix\a) ^ 

[ p = 1 , • • • 9 9 ( 1 

Ui^{x\a) »»ir(x|a) -Vwix) 

[ (T = 9 ( ^®) •••> • 

The system so determined is a conjugate system, as one verifies with the help of 
the relations (90*2), (90*3), and Uig{x) s 0 . With these definitions and nota- 
tions agreed upon we may prove the following theorem: 

Theorem 90*3. For each value a — 1, . . . , g values conjugate to xi on 
the interval xi < x ^ ta are the values xs when any one of the following conditions 
is satisfied: 

( 1 ) The matrix 

ll««(*i) «<.,+*(*,) I 

i ««(*») 

has rank less than 2n — q(ta); 

( 2 ) The matrix ||ui,(xj) || [a = q(t.) + 1, . . . , n] Aoj rank less than n — q(t.); 

(3) The determinant | Uik(xj | a) | u ejMo/ to zero. 

Every accessory extremal «<, «< is expressible linearly in terms of the funda- 
mental system whose elements occur in the matrix in (1). The elements in the 
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first q{it) columns of this matrix are all zero. It can readily be seen, then, that an 
accessory extremal «<, exists with «»(xi) = Ui{x^) = 0 and Ui{x) f^Oon xiXj 
if, and only if, the matrix has rank less than 2n — q{U)> 

Every accessory extremal «,*, Vi with elements Ui all vanishing at Xi is expressi- 
ble linearly in terms of the conjugate system «**, (A = 1, . . . , n). Its elements 
Ui will vanish at x^, and not all be identically zero on Xi Xj, if, and only if, the 
matrix in (2) has rank less than n — qiU). 

The determinant in (3) will vanish when x^ is conjugate to xi by the criterion 
stated in (2). Conversely, if the determinant vanishes, there will be a set of 
constants Cky not all zero, satisfying the equations 

CkUik{xz\a) =0 . 

If this equation is multiplied by Uip{x^ [p = 1, . . . , ^(/a)] and summed for i, it 
follows from the construction of the conjugate systems «,*, v** and «**(*! a), 
1, . . . , «) that the first q{t^ of the constants Ck are zero. It is clear, 
then, that the matrix in (2) has rank less than n — ^(/a), and the theorem is 
proved. 

9L Boundary-value problems associated with the second variation. Re- 
lated to the second variation there are so-called “boundary-value problems*’ 
leading to still other conditions equivalent to the condition IV. One of these, 
analogous to similar boundary- value problems well known for simpler cases, will 
be exhibited here.® The details of the arguments of the following paragraphs go 
back to Hickson,® who applied them to problems of the calculus of variations 
having fixed end-points and no differential or other equations of condition. For 
the problem of Bolza a condition like that given in this section, equivalent to 
condition IV of Section 90, was formulated by Cope [16, p. 21), and an abbrevi- 
ated discussion was given by Hestenes [38, p. 812). A much more detailed ac- 
count has been published by Reid [33, 44). In the paragraphs below, the methods 
used hitherto are somewhat modified. The new condition equivalent to IV will 
be denoted by the symbol IVj. 

In the developments of this section it is understood that the arc E of the class 
ni on which the second variation is taken is normal, has no corners, and satisfies 
the multiplier rule I and the strengthened Clebsch condition llV, The arc E is 
therefore a non-singular extremal, as one sees with the help of Lemma 87 *1; 
and its multipliers taken in the form /o » 1, /p(x) are unique. 

The boundary-value problem of the second variation along £, to be consid- 
ered here, is related to the problem of finding, in the class of admissible sets qi{x) 
satisfying conditions of the form 

(91*1) ^^(^1 ^ 0 > ^(^ 2 ) ] *0 

Xi *» const. , Xi » const. , 

> See Bliss [57]. 

® “An Application of the Calculus of Variations to Boundary Value Problems,” Transac- 
Hans of the American Mathematical Society, XXXI (1929), 563-79. 
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one which minimizes the sum 


(91-2) /.(u, «r) 


( 2 « — (Tiyiiy,*) dx . 


For the present the functions 27 , are supposed not to depend upon the usual 
constants and ( 2 , a restriction which will be removed later. The symbol <r is 
a constant. 

The equations characterizing a minimizing set rii{x) for the problem just pro- 


posed are 

dOij 

0 

ii 

(91-3) 


1 + d 7 + €|. , 

0 

II 


where the next to last equation is, as usual, an identity in the differentials drm^ 
drii 2 . The boundary-value problem to be studied is that of finding the sets rnix), 
€^, a which satisfy the conditions (91 -3). In accordance with established 
custom, a number a which belongs to a solution of the boundary-value problem 
is called a characteristic number^ and a set e,, belonging to such a solu- 

tion is called a characteristic seL 

In terms of accessory canonical variables defined, as usual, by the equations 
(81*3) and with the function ^ defined in the equation just preceding (81*5), 
the conditions (91*3) take the form 


(91*4) 


drji _ ^ d^i _ ^ 

dx~ 


The end-conditions, written out more explicitly, have the form 

- f <i + 7« 0, 

(91-S) + » 

^. = 0 , 

where the subscripts il and i 2 on 7 and indicate partial derivatives with re- 
spect to 17,1 and tia, respectively. The quantities involved in equations (91*5) 
i;{i, i"a, ria, f„. If two such sets, the second indicated by strokes, satisfy 
these equations, then 

(91*6) — ijii fiid" 27 (>j; ij) s* 0 , 

where the third term on the left is a symbol for the bilinear form associated 
with the quadratic form 27 . If the two sets are the same, we have 

(91-7) [i»<M! + 27 = 0. 

A fundamental property of characteristic numbers and sets is given in the 
following lemma: 
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Lembca 91 • 1. //i;i(x), X^(x), e„, or, or in canonical coordinates i;i(x), f i(x), cr, 

is a solution of the boundary-value problem above for the second variation^ then 
(91-8) /2(i7,cr) = 0, 

where a) is defined to be the sum in {91*2), 

To prove this we may replace 2(a by 

2a - rii Qf,i+ 

in the expression (91 * 2) and apply the usual integration by parts. With the 
help of the first equations (91*3) and equation (91*7) it is seen, then, that 
a) vanishes. 

Lemma 91 *2, If a is sufficiently large and negative^ the sum J2(i7, a) is positive 
definite on the class of admissible sets ijifx) satisfying the conditions = 0 

in {91*1), 

In the proof of this lemma we use the fact that a quadratic form aik rn rit in n 
variables has a minimum m on the set of points (171, • . -jVn) for which 17* 17* = 1. 
Consequently, Oik rn rik + era 17* is positive definite if c is sufficiently large and 
positive. 

Consider, then, the integral 
(91*9) f *(2Q — <ri7<i7<; dap 

as a function of the sets rn specified in the lemma. From (81 * 7) and the equation 
just preceding (81*5) we have, in terms of canonical accessory variables, 

212 (x, 17, n, A) = 2f< 2^ 

~ ^riffk ViVk"i" ^BiMkiiik • 

When we set (x, 17, f ) = (x, 0, f ) in this equation, it follows from the condition 
III' presupposed above that 212 is non-negative and hence that the same is true 
of the last quadratic form on the right. With the help of the remark about 
quadratic forms made in the last paragraph, it follows that if 0^ is sufficiently 
large and negative the integral (91*9) is positive definite. 

The same remark about quadratic forms justifies the statement that there is 
a negative constant c so large that 

or) > c{fiii flii + Vit Vii) + / *(212— in7<i7<) dx 

where h(x) is a fimction having a continuous derivative on xiXt and having 
the end-values k(xi) •> — 1, h(xi) ■■ 1. The introduction of the function h is 
a device due to Morse [23, p. 534]. The integral on the right in the last inequal- 
i^ has the form of (91 - 9) and hence is poritive definite if v is sufficiently large 
and negative. It follows at once that /s(q, v) is also positive definite for sudh 
values V. 
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Lemica 91 -3. The least upper bound ai of the values a for which J2(i7, o) is 
positive definite is the smallest characteristic number of the boundary-value problem 
{91'3)or{914). 

One can see, without difficulty, that the sum fry, o^i) must be non-negative 
when <ri is the least upper bound described in the lemma. Otherwise there would 
be a set iy< making /2(^, < 0, and this set would also make a) <0 for 

values a < oi and near oi. The sum /2(»y, ai) cannot be positive definite, since 
the criterion IVJ for positive definiteness of (t), in Section 90, holds for a 
slightly greater than ai when it holds for <ri. Hence there must be an admissible 
set rii satisfying the conditions = 0 and giving /2(iy, ai) the value zero. 

This set rn is, then, a minimizing set for /2(7y, ai) and, consequently, satisfies the 
conditions (91 • 3) with the value <ri. Thus oi is a characteristic number. For every 
other characteristic number 02 the corresponding characteristic set rn satisfies 
the conditions = Oand is such that /2(»7, <^ 2 ) = 0. Hence 72(17, <^2) is not 

positive definite, and 02 is necessarily greater than cri. 

The preceding lemmas now justify, without difficulty, the following theorem: 
Theorem 91 * 1. i4 necessary and sufficient condition that the second variation 
12(17, 0) from formula {91 2) be non-negative on the class of admissible sets rji{x) 
satisfying the conditions = 0 in {91 • 1) is that the smallest characteristic 

number oi of the boundary-value problem {9 13) of the second variation be non- 
negative. A necessary and suffiicient condition for J2(i7, 0) to be positive definite is 
<ri > 0. These conditions equivalent to IV may be designated by IV3. 

In the preceding paragraphs the case was excluded when the second variation 
J2 and the functions depend upon the constants (1 and $2- Results for this 
case, analogous to the preceding ones, can, however, be proved without diffi- 
culty. We define J2 by the equation 

72 (f, 17, <^)= 27 [fi, 7i{xi)y fa, 17(^2)] 

+f *[ 2 w--a(T 7 <T 7 i+fI-t-fJ)] dx 

•'*1 

and regard it as a function on the class of admissible sets fi, f2, i7i(^) satisfying 
conditions in the usual forms 

(91.11V ^^(«, 17, 17') = 0 , vixi ), fa, 

' ^ oPi = const. , X 2 = const. 

To reduce the problem of minimizing 72(f , 17, a) in the class of sets just defined 
to one of the type studied above, we re-write J 2 and the conditions (91 • 11) in 
the forms 

72 * 27[l7n+l(«l), Vi(Xi), 17n+2(«2), ViiXi)] 

“H *[2«— O' (17<17< + 17^+1 + 17^+2) 1 > 

17, 17') “ 17»+1 « 17n+2 * 0 

(9M2) ^|i[l7n+l(«l), 17<(»l), 17n+l(«i), 17<(«2)1-0, 

Xi B const. , Xt » const. 
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The minimum problem is, then, that of finding, in the class of admissible sets 
mC*) (j 1, ...,» + 2) satisfying the conditions (91 ‘12), one which mini- 
mizes Ja. This is a problem of the type specified by (91 ■ 1) and (91 ■ 2). It has as- 
sociated with it a sequence of three lemmas and a theorem which are quite analo- 
gous to those stated above and which give a criterion for the non-negativeness or 
pontive definiteness of the second variation /j({, q, 0 ) in its usual form. 

92. A sufficiency theorem applicable to some important abnormal cases. In 
his paper [38] on sufficient conditions for the problem of Bolza a theorem [p. 815, 
Theor. 9-2] analogous to the following one was proved by Hestenes: 

Theobeu 92 ‘ 1. Let E be an arc without comers belonging to the class HI which, 
by definition, consists of all admissible arcs satisfying the conditions 0 a = 0 * 
If E has a set of multipliers of the form lo = 1, la(x) with which it satisfies the 
conditions I, IIs, 111', IV', then it is a non singutar extremal, and there exists in 
(x, y)-space a neighborhood F of the arc E sttch that J(C) > J(E) for every arc C 
of the doss lit in F not identical with E and having its end-points sufficiently near 
to those of E. 

This statement differs from that of the basic sufficiency theorem. Theorem 
82 ‘ 1, only in the absence of the assumption that the arc E is normal. The proof 
of Theorem 92*1 given here is effected by making it depend upon the normal 
case of Theorem 82 • 1.^ 

If the order of abnormality of E is q, there must be q linearly independent ab- 
normal sets of multipliers and constants == 0 , lo,{x), ep,(o — 1, . . .,q) with 
which E satisfies the multiplier rule of Theorem 74 - 1. The equation 

(921) + 

with F, => is for each o an identity in the variables (i, it, rni, va, since this 
is what the identity in (74-9) becomes for the multipliers for — 0, l0,(x), e^m 
when the end-values of dx, dyt are replaced by those of (, ( + rii. The usual 

integration by parts applied to the int^;ral of the sum 

le,<be = Fr,j iji +Fr»; Vi 

gives the equation 

(92-2) f\9pdx~lF^’,ni]\, 

*1 

so that for every admissible set of variations satisfying the equations 0 we 
find, with the help of equations (92-1) and (92-2), 

(92'3) [Frjij ij<] 1 “* 0 , Cuf'tit 0 • 

The matrix of q sets of values (e^ ■■ 1, . . . , 9 ) is necessarily of rank q. 
Otherwise there would be a linear combination of Aese sets vanishing identi- 
cally; and the same linear combination of the sets for, «»• would then also 

> See bum (SO, pp.:373-74. 
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vanish identically, according to the remark in the paragraph preceding Corol- 
lary 74*1. This is, however, impossible, since these sets are, by hypothesis, 
linearly independent. 

In the following paragraphs the variable p is understood to have as its range 
a subset of q of the numbers p = 1, . . . , ^ such that the determinant \e^\ is 
different from zero, and the variable v will have the range complementary to 
that of p. The second equation (92 *3) then shows that for an admissible set fi, 
fe, mix) the equations == 0 are consequences of the equations 4>/j = = 0. 

Theorem 92*2. Let E be an arc satisfying the hypotheses of Theorem 92- 1, 
and let p and v be variables whose ranges are the subsets of the numbers p » 1, . . . , p 
determined in the last paragraph. Then the arc E is normal for the modified problem 
of minimizing the functional J(C) + ep ^p(C) in the class of admissible arcs C 
satisfying the reduced system of equations — ypv = 0, and the arc E with the 
multipliers /o = 1, /p(x), e^ satisfies the hypotheses of Theorem 92' 1 for the modi- 
fied problem. Furthermore^ if J(E) + ep ^p (E) is a strong relative minimum for 
the modified problem^ then J(E) is a similar minimum for the original problem. 

It is easy to see that the arc E is normal for the modified problem. For if E 
had for that problem a set of non-vanishing multipliers of the form /o = 0, 
e,, the set U = 0, lfi(x)^ ep = 0, would be multipliers for E and the original 
problem, necessarily linearly expressible in terms of the q sets loa = 0, lfii,{x), e^, 
(<r = 1, . , . , This is, however, impossible on account of the fact that the de- 
terminant I^ppI is not zero. 

The arc E satisfies the hypotheses of Theorem 92 • 1 for the modified problem 
with the multipliers k = 0, /p(x), as may be seen by an examination of the 
conditions I, Iljv, III', IV'. For the condition IV' one needs to note that, on 
account of the second equation (92 *3), the restricted system of equations of 
variation = 0 implies the complete original system = 0. 

Since the class of arcs in which a minimizing one is sought for the modified 
problem includes as a subclass those among which a minimizing arc is sought for 
the original problem, and since on the subclass the values of the functionals 
/(C) + ep ypp{C) and /(C) are equal, the last statement of the theorem is evi- 
dently true. 

Thus we see that the proof of the sufficiency theorem for the abnormal arcs 
studied by Hestenes can be reduced to a proof of the minimizing property for an 
arc which is normal. The abnormal cases covered in Theorem 92 • 1 are the most 
interesting ones for which sufficient conditions have so far been established. 
Further interesting results concerning abnormality are found in the papers 59, 
60, and 61 by McShane. 




APPENDIX 




APPENDIX 


EXISTENCE THEOREMS FOR IMPLICIT FUNCTIONS 
AND DIFFERENTIAL EQUATIONS 

The development of the theory of the calculus of variations depends intimate- 
ly upon existence theorems for implicit functions and differential equations. 
References to such theorems are frequent and of fundamental importance in the 
theory. As a result of their use in the calculus of variations the theorems them- 
selves have undergone extensions and improvements which have not always 
found their way into the textbooks on the theory of functions of a real variable. 
Notable among such extensions are the theorems of Section 2, below, concerning 
implicit functions, and the imbedding theorem for differential equations in Sec- 
tion S. On account of their great usefulness these existence theorems are pre- 
sented here in a form which it is hoped will be especially convenient for students 
of the calculus of variations. The presentation is based upon mimeographed 
notes which the author has used in recent years in his classes at the Univer- 
sity of Chicago. 


I. EXISTENCE THEOREMS FOR IMPLICIT FUNCTIONS^ 

1. The fundamental existence theorem for implicit functions. The equa- 
tions whose solutions are to be studied have the form 


/< ( • • • I ^mi y\^ • • • > ^n) — 0 (f— 1, •••, fl) • 

When convenient, the sets {xu . . . , yi, . . • , yn), fe, ...» (yi, . . . , yO 

may be denoted more simply by (x, y), x, y, respectively; and the notations 
(a, &), a, b in the following theorems will indicate sets of similar types. By 
D(x, y) is meant the functional determinant 


D(-x, y) 


a(/i, 

d(yi, 


yn) dy* 


The neighborhood (a, is the totality of points (x, y) satisfying the inequalities 

|xr-ar| ^€, |y< — 6t| (r = 1, . . . , m; 1, . . . , ») , 

and a«, have similar meanings. With these understandings as to notations the 
fundamental theorem is now the following: 

^ See Bliss, Fundamental Existence Theorems (Princeton Colloquium of the American Math- 
ematical Society) (1909); Bolza, Vorlesungen Uber Varialionsrechnung (2d ed.; 1933), chap, iv, 
pp. 154-88. 
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Theorem 1 • 1. Ze/ p * (a, b) be a point with the following properties: 

(1) the functions fi(x, y) are continuous and have continuous derivatives dfi/dyk 
in a neighborhood N of p; 

(2) f,(a,b)«0 (i=l,...,n); 

(3) D(a, b) 5 ^ 0. 

Then there exists a set of functions yi(xi, . . . , Xm) single valued and continuous in 
a neighborhood aa and having the following properties: 

(1) The points [x, y(xi, . , . , Xm)] which they define are in N and satisfy the 
equations fi = 0; 

(2) There exists a constant e such that for each x in aa the set [x, y(xi, . . . , Xm)] 
is the only solution (x, y) of the equations fi = 0 satisfying the inequalities 

yi{xij “€<yi<y<(xi, x„,)+€; 

(3) yi(aij • • • 9 am) “ bi (i = li • • • , n), 

(4) In a sufficiently small neighborhood aa the functions yi(xi, . . . , Xm) have con- 
tinuous partial derivatives of as many orders as are possessed by the functions fi in 
the neighborhood N. 

To prove the theorem it can first be shown that in a sufficiently small neigh- 
borhood (a, 6)« the equations /< = 0 can have no pair of distinct solutions (op, y), 
(op, y') with the same projection x. By Taylor^s formula with integral form of re- 
mainder 

(M) /<(ap, y')-fi{x, y) =B<i(yi-yi) + . . . +-B.n(yn-yn) , 

where 

Bikix, y, y') = f finix, y+ diy' -y)] de . 

The determinant B(x, y, y) = \ Bn, | has the initial value B(a, b, b) = D{a, b) ^ 0, 
and hence a neighborhood (a, b), can be chosen within the neighborhood N of 
the theorem and so small that in it the determinant B(x, y, y') remains different 
from zero. The expression (1-1) can then vanish for a pair of points (*, y), (*, •/) 
in (a, b), only if y and y are identical. For the purposes of later paragraphs it 
should be noted that D{x, y) = B{x, y, y) is necessarily different from zero in 
the neighborhood (a, b), where B{x, y, y') does not vanish. 

There exists a neighborhood at in which every x belongs to a solution (x, y) 
in (a, b),. To prove this, one may make use of the auxiliary function 

«»(*, y) = /!+... +/i. 

On the closed set of points (x, y) a (a, tj) for which ii ranges on the boundary 
b, the inequality 


vio, n) “<p(o, b)>0 
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is satisfied, since by the results of the preceding paragraph ^(a, y) vanishes only 
at y = 5 in the neighborhood &«. Hence the inequality 

ip{x, 17) i) > 0 

holds true when x lies in a sufficiently small neighborhood and 77 has the same 
range as before. This implies, however, that for each fixed x in at the continuous 
function y) has its minimum at a point y interior to &«, and not at a point 
y = )/ on the boundary of 4,. At such a minimizing interior point for ^ the de- 
rivatives 

2 dyi ^^dyr ^^dyi 

must vanish; and since Z>(x, y) 5*^ 0 in (a, 4)„ it follows that the functions /, all 
vanish. To each x in at there corresponds, therefore, a solution (x, y) of the 
equations /i = 0 in (a, 4)«. 

The single-valued functions ytte , . . . ^x^) thus defined in the neighborhood 
at are continuous. For if their values at x and x+ Ax are denoted, respectively, 
by y and y+ Ay the equations 

0 = /i(« + Ax, y + Ay) = /<(x-f Ax, y + Ay) 

fii^ + AXy y)+/<(x + Ax, y) 

— ^^BikAyk+ fi{x+Ax, y), 

k 

where the arguments of B,* are (x, y, y') = {x + Ax, y, y + Ay), show that the 
increments Ay* approach zero with Ax. The conclusions of the theorem, with the 
exception of (2) and (4), are now evident. 

Conclusion (2) can be readily established. The set of points [x, y(xi, . . . , Xm)l 
defined by the functions y<(xi, . . . , Xm) on the neighborhood at is in (a, 6)«, and 
hence the determinant D{x, y) = B(x, y, y) is different from zero on it. The de- 
terminant B(x, y, y') will remain different from zero for every pair (x, y), (x, y') 
in a sufficiently small neighborhood of the type defined by the inequalities in 
conclusion (2) of the theorem. In such a neighborhood the expressions (1 * 1) 
show, as before, that there can be no pair of distinct solutions (x, y), (x, y'). 

To prove conclusion (4) let Ax2 = . . . = AXm = 0 in equations (1*2). These 
equations can then be written in the form 



where 

Ci^ f /<,,(xi + 4Axx, X*, ..., x„„ v)de. 

•'0 

It is evident that as Axi approaches zero the quotients Ay*/Axi approach defi- 
nite limits, and the existence of the derivatives of the functions y»fxi, . . . , Xm) 
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with respect to the variables can be similarly proved. Furthermore,' 

the derivatives with respect to Xr satisfy the equations 


dfi dyt ■ _ Q 

dyudXr dXr 


(» = 1 ») , 


and these equations can be solved for dyt/dxr {k ^ The expressions 

so found are continuous in «i, . . . , Since the functions yi{xi , have 
continuous first derivatives, it follows from the same expressions that when the 
functions fi have continuous second derivatives the functions yi(xi, Xm) 
will have the same property. By an easy induction one obtains then the conclu- 
sion (4) of the theorem. 

2. An extension of the theorem of the preceding section. The theorem which 
has just been established shows that, if a single initial solution (a, i) of a system 
of equations /<(», y) = 0 is known, there will, in general, be a continuous set of 
solutions [x, y(xi , ...,««)] containing the initial one. In the following paragraphs 
it is to be proved that every continuous set of solutions [*, y(*i, . . . , *»)] with 
suitable properties is itself a part of a larger set. If ^ is a set of points x, the nota- 
tion Ai indicates the set of points x each of which satisfies the inequalities 
1», — a,| ^ S (r — 1, m) with at least one point aotA. 

Theorem 2- i. Let 'P be a set of points [x, y(xi, . . . , Xm)] defined by a set of 
functions ytCxi, . . . , Xm) which are single valued and continuous in a bounded 
dosed region A of x-space and with the further properties: 

(1) The functions Uix, y) are continuous and have continuous derivatives in a 
neighborhood N of P; 

(2) fi(x, y) = 0 m P (i = 1, . . . , n); 

(3) D(x, y) 5 ^ 0 m P. 

Then there exists a set of fundions Yi(xi, . . . , Xm) single valued and continuous in 
a neighborhood At and having the properties: 

(1) The points [x, Y(xi, . . . , Xm)j which they define are in N and satisfy the 
equations L =» 0; 

(2) There exists a constant c suds that for each x in At the set [x, Y(xi, . . . , Xm)] 
is the only solution (x, y) of the equalities ft = 0 satisfying the inequalities 

YdXu .... *«,) -e<y<< Ydxi, ...,*«)+€ (** 1, . . . , »); 

(3) Yifxi, . . . , Xm) = yifxi, . . . , Xm) t» Aj 

(4) In a st^cietUly small neighborhood At the functions Yi(xi , , Xm) have 
continuous partial derivatives of as many orders as are possessed by the fundions L 
in the neighborhood N. 

To prove this, one should note first that the set of points P is bounded and 
closed because A is bounded and closed and that the functions yi(«) are con- 
tinuous in ^1. 
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There exists a neighborhood /*• in which no two distinct solutions (x, y) of 
the equations /i = 0 have the same projection *. Otherwise, for every positive 
integer k there would be a set (*, y, y% for which (*, y)* and (», y')* are distinct 
solutions of the equations /< » 0 in the neighborhood Pi/k. The sequence of 
points (x, y, y'jt would be bounded and hence would have an accumulation 
point ({, 71, ijO- The point ({, ij) would necessarily be in P. Otherwise, since P 
is closed, there would be neighborhoods ((, ti),, P, having no pomt in common. 
The former would contain an infinity of the points (x, y)k, and therefore surely 
some of those for which 1/k < c and which are in P,. But this is impossible. 
Similarly, ((, n') would be in P, and ({, ri') would coincide with ({, ij), since no 
two distinct points in P have the same projection In every neighborhood of 
((, ij) there would then be an infinity of the distinct solutions (*, y)*, (#, y')k of 
the equations /< =» 0, which is impossible, according to the first paragraph of the 
proof of the preceding theorem. The existence of a neighborhood P, containing 
no two distinct solutions (x, y), (x, yO of the equations /< « 0 is therefore 
proved. 

There exists a neighborhood Ak with 3 ^ c in which every x is the projec- 
tion of a solution (x, y) of the equations /< = 0 in P,. Otherwise, for eadi posi- 
tive integer k there would be a point (x)k in the neighborhood Ai/k which would 
not be the projection of any solution of the equations fi =» 0 in P«. The sequence 
(«)« would be bounded and would have an accumulation point ( which, by an 
argument similar to that of the last paragraph, would be in ^4. It would there- 
fore belong to a point (|, i;) of P. According to the proof of the fundamental 
theorem of the preceding section, however, there would then be a neighborhood 
(k in which every point * would belong to a solution (x, y) of the equations fi==^0 
in the neighborhood ({, i;).. Thus the assumption made above would be contra- 
dicted; and it is evident that there must be a neighborhood in which every 
point X belongs to a solution of the equations /< = 0 in P„ which may be de- 
noted by [*, Y(xi, , #»)]. 

The single-valued functions Yi(xi, ...,*«) defining the solutions of the equa- 
tions /{ = 0 in P« for the points a; in A* are continuous and have continuous de- 
rivatives of as many orders as are possessed by the functions/,- in the ndghbor- 
hood N. This follows from conclusion (4) of the theorem of the preceding sec- 
tion applied at the point (a, b) = [*, Y(xi *«)], at which all the h 3 q)otheses 

of that theorem are satisfied, provided only that the neighborhood P« is taken 
so small that the determinant D(x, y) is everywhere different from zero in it. 
Thus conclusions (1) and (4) of the theorem of this section are proved. Conclu- 
sion (3) is an immediate consequence of the fact that in the neighborhood P. no 
two distinct solutions of the equations fi^ 0 can have the same projection x. 
Conclusion (2) follows from the second paragraph of the proof of the theorem 
applied to the set of points [x, Y{xi , ..., «.,)] corresponding to the points x 
in Ak. 
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n. EXISTENCE THEOREMS FOR DIFFERENTIAL EQUATIONS* 

In the following paragraphs the existence and imbedding theorems for dif- 
ferential equations which are useful in the calculus of variations will be deduced. 
The notations of the preceding pages on implicit functions will again be useful^ 
with the additional agreement that the modulus of y is defined by the equation 

mod y =* (yi+ . . • • 

3 . The existence of a solution through an initial point. The system of equa- 
tions to be considered has the form 

( 31 ) ^ = /a*, yi, •••, y«) (t=l, 

and the initial point through which a solution is sought is ((,17) » (i, 171, , ijn) . 
The functions /»•(«, y) are supposed to have the following properties: 

(a) They are single valued and continuous in a neighborhood ((> i7)<; 

(i) There exists a constant i such that for every pair of points (x, y) and 

(x,r)m((,v). 

!/•(«, y) F)|<^mod(y-F) (i=l, ...,n). 

The factor l/n^^* is introduced for convenience. The maximum of the abso- 
lute values \fi\ in the neighborhood ({, 17)4 will be denoted by M. By a solution 
of the equations (3 • 1 ) is meant always a set of functions yi(x) which, with their 
derivatives yi(x), are continuous on an interval xix^ and satisfy the equations. 
Such a solution is said to be interior to ((, if), if all the points [x, y{x)] which it 
defines on ^1X2 are in this neighborhood. The first theorem to be proved is then 
the following: 

Theorem Z l, If the functions fi(x, y) have the properties described above in a 
neighborhood ((, 17)*, then there exists in ((, 17). one and but one solution yi(x) of 
equations {3 • I) through the initial point ({, 97) . The solution y i(x) is defined and in- 
terior to ((, i7)« at least on the interval |x — ^ 1, where \isthe smaller of c and 

c/M. 

To prove the theorem consider the sequence of functions starting with y lo » rti 
and defined successively by the equations 

y<i = Vi /<(*, yo)rf*, 
y«-fli+^*/<(*, yOd*. 


y<.«+i - Vi fiix, ym)dx, 


where in the integrands only the second subscripts of the y<M are indicated. 

* For earlier presentations and further references see Bliss, op. cit., and **The Solutions of 
Differential Equations of the First Order as Functions of Their Initial Values,** Annals of 
Matkematks, 2d ser., VI (1905), 4CM»; also Bolza, op. cit. 
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Hie functions of thtf sequence are all well de^ed on the interval | « — ( | g /, 
since the set ya evidently has this property; and since when the set defines 
points (», ym) in (f, i;)., then also 

|yi.«+i->?il ® 1^ ym)dx^^M\x- i \ <t. 

Consequently, the points {x, y^+i) are also in 17 ), and can be used to define 
the next functions of the sequence. 

It can now be shown that the series 

(3-3) y<i + (yij - y.-i) + . . . + (y<m - y<.m-i) + . . . 

convei:ge uniformly on the interval | « — £ | ^ f, so that the same is true of the 
sequence of functions yim- If F is a constant such that |y<i — y<o| ^ -F on the 
interval |x — $| ^ f, then one can easily see, by an inductive proof with the 
help of the property (5), that 

|y<j-y<i| ^ | 7 ^^mod(y,-yo) d«| 


|y<.m+i-y<ml 


ml 




ikl)^ 

ml 


Hence the terms of the series (3 - 3) after the first are dominated by the terms of 
the convergent series of constants P(e*^ — 1). Since the terms of the series 
(3*3) are all continuous functions of x on the interval ~ ^ I, the same is 

true of its limit yi(x). 

The limit functions yi(x) satisfy the integral equations 

(3-4) yi{x)=ni+f filx, y(.x)]dx, 

•'t 

since the two members of this equation are the limits of the corresponding se- 
quences' in equations (3*2), It follows at once, by differentiating the last equa- 
tion, that the system yi(x) is a solution of (3 *1). 

There cannot be two distinct solutions y»(x), Zi(x) through the initial point 
((, ri). Such a pair of solutions would, in fact, satisfy the relations 


yi 


Vi 


fi(x, y)dx , 


Zi = tii fi(.x, z)dx, 

\yi- *<1^1/ (y- 8)dac| . 
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If Q is the maximum of the expressions — s<(«) | on |« — {| ^ the last 
inequality shows successively that 

\yi-zi\ mk\x-k\ 


m! f«! • 

Since this expression approaches zero as m increases indefinitely, it follows that 
yi{x) s Zi{x), and the two solutions must be identical. 


4. Existence theorem for linear equations. For a system of equations linear 
in the variables y,-, of the form 


(4.1, 


^1) • • • I ^«) • • • I ^*) 

(i=l, 


«), 


the methods of the preceding section give further results of importance. It is 
presupposed that the functions Aikix^ 6), Bi(Xy b) are continuous for all sets 
(«, b) satisfying the conditions 

^ ( ii, . . . , &•) in a region B , 


where is a bounded closed region in the space of points (6i, Then one 

can prove: 

Theorem 4 1. Through each initial point (f, ??) with xi ^ ^ X 2 , and for each 

set b in B, equations (4 * 1) have one and but one solution. The functions y i(x, f b) 
defining this solution and their derivatives y ix are continuous for all sets (x, rj, b) 
satisfying the conditions 

(4-2) ^ 1 ^^, (inB. 

If in the sequence (3 • 2) the functions /* are the second members of equations 
(4-1), it is easy to see that the functions yimix, (, b) of the sequence are de- 
fined and continuous for all sets (x, q, b) satisfying the conditions 
(4-3) Xi^x^X2, Xx^i^X2, |77 i|^C, binB, 
where C is an arbitrary constant. The constants k, P of the convergence proof 
can be selected so that they are effective simultaneously for all x, q, b satisfy- 
ing the conditions (4-3). Hence the sequence yimix, ly, b) converges uniformly 
for all sets (x^ f, q, b) satisfying (4*3), and the limit functions y<(x, f, q, b) are 
continuous for all sets (x, q, b) satisfying (4 *2), since the constant C is arbi- 
trary. The equations (4*1), satisfied by these functions, show that the deriva- 
tives yi* have the same continuity property. The proof of the uniqueness of the 
solution when 17, b are fixed is identical with the uniqueness proof of the last 
section. 


5. The imbedding theorem. In Section 3, above, it was shown that through 
each initial point ((, q) there exists one and but one solution of the differential 
equations (3 * 1) defined on a properly chosen interval | x — £ | ^ /. In the present 
section it is proposed to prove that a solution £, already known, is always a 
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member of a family of such solutions determined by neighboring initial points. 
The theorem to be established is the following one: 

Theorem 5 - 1. The Imbedding Theorem. If in a neighborhood R of the points 
(x, y) on an arc £ having equations of the form 

(SI) yi^yi{x) {xx^x^x^) 

the functions fi(x, y) have the continuity properties (a) and (b) of Section J, and if E 
is a solution of the differential equations (J * i), then through every point (f, tf) suffi- 
ciently near to £ there passes one and hut one solution 

yi^y^ix, ii) 

of the differential equations. The functions y,(x, f, yf) their derivatives yix(x, f , t;) 
are continuous and satisfy the differential equations {S' 1) identically for all sets 
(x, t, 7?) in a neighborhood N of those belonging to E. 

In order to prove the theorem it should first be noted that the curve (5 * 1) 
satisfies the equations 

(S-2) y.{x) ^yAi)+ rf^[x, y{x)]dx, 

since it is a solution of equations (3 1). The notation E. will be used to denote 
the totality of points (jr, y) satisfying the conditions 

Xi^x^xi, |y,-yi(*) I ; 

and Ei will represent a similar neighborhood for which 

b , / = X2 — . 

It is always understood that the constant € is taken so small that the whole of 
is contained in the neighborhood RoiE where the functions /» have the continu- 
ity properties of the theorem. 

The proof is made by means of the sequence of approximation functions 
yim{x, (, rj) defined by the equations (3 *2), in which, however, 

(5-3) ytoCxy f, v) == Vi + yi(x) -yt(() . 

The elements yim(x, tj) of the sequence are well-defined and continuous for all 

sets (x, f , Tf) satisfying the conditions 

(5*4) Xx^x^Xt, ((, rj) in Es, 

since with the help of equations (5 *3), (3 *2), (5 -2), and the property (b) it fol- 
lows successively that 

|y<o-y<(*) I = I ^ , 

ly<i-yi(*) I ^ I »7{-y<(€) {/<(*, yo)-/il*i y(*)] 


|yi»-y<(*) I < «| 14 


11 


+ . 






m\ 
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and the last expression is less than 8e^‘, which is, in turn, less than c. Hence, 
every approximation curve lies in the neighborhood JS. and can be used to de- 
fine the succeeding one. 

The convergence proof for the sequence yim(x, (, n) is identical with that of 
Section 3. It should be noticed, however, that a constant P exists such that 

f, i>) 1 

for all sets (x, (, ti) satisfying the conditions (5-4), on account of the continuity 
of ya and y^o- Hence the convergence of the sequence ytmCx, v) is uniform in 
the rq;ion of points (x, |, q) defined by (5-4), and the limit-function yi(x, (, q) 
is also continuous in that region. 

The proof that yi(x, {, q) satisfies equations (3 - 1) and is the only solution of 
these equations through the initial point ((, q) is the same as that of Section 3. 
The continuity of yix(x, (, q) in the region defined by (5-4) follows from equa- 
tions (3-1) and the continuity of y,(*, f, q). 

If the existence theorem of Section 3 is applied at the end-points of E, an ex- 
tended solution E' is found on an interval xix^ containing xixt in its interior. 
The arguments of the preceding paragraphs applied to E' justify the continuity 
properties of yi(x, {, q) and yix(x, |, q), not only for points (x, q) with Xi^ x^ 
Xa, but also for a neighborhood N of the kind described in Theorem 5 - 1. 

6. Derivatives with respect to constants of integration. In order to describe 
the circumstances under which the solution yi(x, (, q) of the last section has 
partial derivatives with respect to the constants q, let us consider first a family 
of solutions 

(6-1) y.-=Fi(*, i) 

of equations (3-1), including the original solution E for the parameter values 
(6-2) Xi^x^Xj, ft* “5*® (A»l f) 

and having fimctions y<(«, ft) and their derivatives yis(x, ft) continuous in a neigh- 
borhood of the values (x, ft) satisfying the conditions (6-2). Then we can prove: 

Theobeu 6-1. If ike functions fi(x, y) are continuous and have continuous 
partial derivatives of orders ^ q v/i^ respect to the variables yin a neighborhood R 
of the points (x, y) on the arc E, and if at one particular value x» the functions 
Yi({, b) have their partial derivatives of orders ^ q with respect to the variables b 
continuous near b » b®, then the solutions Yi(x, b) of equations (3-i) and (heir 
derivatives Yu(x, b) have continuous partial derivatives of orders ^ q voUh respect 
to the variables b at every point (x, b) in a neighborhood of the sets (x, b) belonging 
to E specified in (6-2). 

If the functions fi have continuous partial derivatives of orders ^ q with respect 
to all of the variables x, y in the neighborhood R, then the functions Yi(x, b) and 
Yix(x, b) have continuous partial derivatives of orders ^ q with respect to all of (he 
variables x,b in a neig/tborhood of (he values (x, b) bdongfng to E specified in 
( 6 - 2 ). 
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The theorem will be proved for the case when there is only one parameter b. 
The extension for several parameters is simple. 

To prove the existence of the first derivatives dYi/db for values x ^ it 
should first be noted that for values ft + Ai near ft** the quotients 
_ F<(*, ft+Aft)-F,(*, ft) AF, 

Aft Aft- 

are solutions of the linear equations 

(6-3) ft, Aft)s*, 

where ^ 

A« = / /«(*, F+ftAF)dft. 


The notation /<* is used for the derivative dfi/dyk. The functions Aik are con- 
tinuous in a neighborhood of the values (x^ b, Ab) satisfying the conditions 
Xi^x^Xiy 6 = 6®, A6 = 0. 

Hence, according to Theorem 4 1, there exists through the initial point (f, f) 
one and but one solution Zi(x, f , 6, A6) (t = 1, . . . , n) of the linear equations 
(6 ‘3). For this solution the functions Zi and their derivatives Zi» are continuous 
at sets (jc, {, 6, Ab) with f arbitrarily chosen and with values (x, 6, Ab) in a 

neighborhood of those defined by the conditions 

Xi^Xy {^^2, 6=6®, A6 = 0. 


The solutions Zi = AF,/A6 of equations (6 -3) are expressible in the form 


AYi 

Ab 




where the notation (AY/Ab)^ stands for the set of initial values of the quotients 
AF«/A6 at ^ = (. If these quotients approach limits as A6 approaches zero, as 
supposed in the theorem, it follows from the last equation that the same is true 
for AF/A6 at every value of x. Thus it is clear that the derivatives dYi/db exist, 
have the values 


dYi 

db 


[*• »]• 


and are continuous for all sets (oc, 6) in a neighborhood of those specified by the 
conditions (6*2). 

The preceding paragraph is a proof of the first part of Theorem 6 • 1 for g = 1 . 
We now assume the theorem true for integers S q and prove that it holds for 
g + 1. We start with the assumptions that the functions /< have continuous de- 
rivatives of orders ^ g + 1 in the variables y and that at a fixed value « = { 
the family of solutions (6 *1) has continuous partial derivatives of orders ^ g+ 1 
with respect to 6. The equations 


(6-4) 
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are of the type (3 • 1 ) in the variables i, «. They have a family of solutions 
(6-5) 6 , ziix, b)^Yi{x, b) 

analogous to (6*1) for equations (3*1). It contains an initial solution analogous 
to E for the values (6 *2). The functions s»({, b) have continuous derivatives of 
orders ^ q with respect to 6 , since F<({, b) is now assumed to have derivatives of 
orders ^ q + 1. The second members of equations (6 *4) have continuous de- 
rivatives of orders ^ q with respect to b and Zk in a neighborhood of the values 
X, b, z belonging to the initial solution of (6 *4) defined by the values (6*2). 
Hence the family (6*5) has continuous derivatives of orders ^ q with respect 
to i in a neighborhood of the values (x, b) specified by the conditions ( 6 - 2 ), since 
the theorem is now supposed to hold for the integer q. The functions Ft(x, b) 
have, therefore, continuous derivatives of orders ^ g + 1 with respect to b in 
the same neighborhood, which was to be proved. That the same is true of the 
functions F<*(x, b) is seen by substituting F<(x, b) in equations (3*1). 

The second paragraph of Theorem 6 * 1 can easily be justified for ^ = 1 by 
establishing first, as above, the existence of continuous derivatives F»*(x, b) 
and Frt(x, A). Then it follows from equations (3 *1) with F»(x, b) substituted 
that Yis has continuous first derivatives. 

The induction needed for the second part of the theorem is much like the one 
above. We start with the assumptions that the functions /< have continuous de- 
rivatives of orders ^ ^ + 1 with respect to all of the variables x, y and that 
yi((, b) has continuous derivatives of orders ^ ^ + 1 with respect to A. The sec- 
ond members of equations (6 *4) now have continuous derivatives of orders ^ q 
with respect to all of the variables x. A, zt; and their solutions (6 *5) have, there- 
fore, continuous derivatives of orders ^ q with respect to all of the variables x, A 
in a neighborhood of the values (x. A) specified by (6 * 2). It follows readily, by an 
argument like that of the next to the last paragraph, that Fi and Yix have de- 
rivatives of orders ^ ^ + 1 in all of the variables x, A. Thus Theorem 6 * 1 is 
completely proved. 

If the family (6 * 1) contains n parameters bk{k — 1, . . . , n) and is such that 
the functional determinant \dYi/dbk\ is everywhere different from zero, then 
the equations 

( 6 * 6 ) F<(«, A) 

will, in general, have solutions bkHj fi) which define a solution 

Yi[x, A(f, v)] 

of the differential equations (3*1) through the point ((, 17 ). On account of the 
uniqueness of the solution through ({, 17 ) we must then have 


(6*7) 


Vil*, >»)] f i»)i 
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where the functions on the right are the ones whose existence was stated in 
Theorem 5 * 1. We shall use this remark to advantage in the proof of the following 
theorem: 

Theorem 6 *2. If the functions fi(x, y) are continuous and have continuous 
partial derivatives of orders ^ q with respect to the variables yin a neighborhood R 
of the points (x, y) on the arc E, then there is a neighborhood N of the points (x, (, rf) 
on E in which the functions y i(x, f , n) of Theorem 5 • 1 and their derivatives yix{x, 
ri) have continuous partial derivatives of orders ^ q with respect to the parameters 
rikfor each fixed 

If the functions fi have continuous partial derivatives of orders ^ q with respec 
to all the variables x, y in a neighborhood R, then there is a neighborhood N in 
which the functions yi and their derivatives yiz have continuous partial derivatives 
of orders g q with respect to all of the variables x, f , ri. 

There is always a family (6 *1) containing E and having n parameters b such 
that the functional determinant \dYi/dbk\ is everywhere different from zero. 
Such a family is the one defined by yt(x, ri) for a fixed value The functional 
determinant | dyi/drik \ has the value unity at f and is different from zero 
everywhere, since its columns form a fundamental system of solutions of the 
linear equations 

= /«[*. y(*, »»)] z*- 

The proof of this well-known property is like that of Section 12 of Part I above 
for the case when there are four variables z. 

The equations (6 *6) have initial solutions $, b defined by the conditions 
(f, 1 ?) on the arc£ , 6=6®. 

Hence, according to Theorem 2 • 1, they have a solution 6,Cf, tj) defined and con- 
tinuous in a neighborhood of the values ((, y) belonging to the arc E and having 
the values 6? whenever ((, rj) is on E, As a consequence of the hypotheses of the 
first part of Theorem 6*2, the functions 6<({, y) have continuous derivatives of 
orders ^ q with respect to the variables yk when f is fixed. According to the hy- 
potheses of the second part of the theorem, the functions 6i(f, y) have continu- 
ous derivatives of orders ^ q with respect to all of the variables f, y. On account 
of the identity (6 *7) these properties justify the conclusions of the theorem. 

It is perhaps interesting to note that, as a result of the hypotheses of the first 
part of the theorem, the functions f, y) have more continuous derivatives 
than are there specified. The solution bk(^, y) has, in fact, in that case, a part of 
its derivatives of orders ^ q with respect to all of the variables y continuous. 
The continuous ones are those which do not involve more than one differentia- 
tion with respect to Evidently, the same will then be true of the derivatives 
of yi(Xf y) and y»x(«, f, rj) with respect to the variables y. 

Corollary 6 *3. Suppose that the equations 

(t=l, n) 


( 6 . 8 ) 
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m2 

have a particular solution E for special values bro (r » 1, . . . , s) parameters b 
and that the functions Uix, y, b) are continuous and have continuous partial deriva- 
tives with respect to the variables yk, br in a neighborhood of the values (x, y, b) he- 
longing to £. Then for parametric values br sufficiently near to the bro there exists one 
and but one solution of equations (6 * 8) through a point ((, ri) sufficiently near to £. 
The functions 

= 1 ?, h) 

defining this solution^ and their derivatives y iz, are continuous in a neighborhood N 
of the values (x, 17 , b) belonging to £. If the functions fi(x, y, b) have continuous 

derivatives of orders ^ q with respect to the variables yk, br, then for each fixed 
( » Xo the functions y i, y iz have partial derivalives of orders ^ q with respect to 17 k, 
br continuous at each (x, 17, b) such that (x, Xo, 17, b) is in N. If the functions fi(x, 
y , b) have all partial derivatives of orders ^ q continuous^ then the functions y i, y iz 
have all partial derivatives of orders ^ q with respect to the variables x, 17, b 
continuous in N. 

These results are immediate consequences of the preceding corollaries applied 
to the system of equations 

-^ = /<(*, y, ft), -5^ = 0 (» = 1, f = 1 s). 

In conclusion, attention may be called to a formula which is sometimes use- 
ful. The matrix (^, v) II reduces to the identity matrix at x = (, and its 
columns form a fundamental system of solutions of the linear equations 

(6-9) *<-/«[*, y(x, (, v)Jet. 

Every solution Zi of these equations is expressible in the form 
(610) 

where the f « are the initial values of the functions zt at x = This is evident 
since the second members of equations (6 ■ 10) are solutions of the equations (6 - 9) 
and have at x = ( the initial values ti which characterize uniquely the solu- 
tions Zi. 

On account of the identities 

ri) =rn, 

we have the equation 

yixii, i, v)+yit((, (, v)=0; 

and we see that the solution 17 ) of equations (6 *9) has at X * ( the initial 

values 


Vit((f i, i») - -yi,(i, i») - -/<({, n)- 
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Hence, by means of (6 ■ 10), the solution y<t is expressible in the form 
(611) yaix,' t, 1 ?) = -/*(€, f, ij). 


These equations show that for a fixed value x the functions yi{x, (, it) are n inde- 
pendent solutions g((, if) of the partial differential equations 




+ /*(?» v) 


IL 

diik 


= 0 . 


They also express the fact that for a fixed x, y the equations 
(6- 12) y. = yi(*, It) 

define a curve with running coordinates {, it which satisfies equations (3 ■ 1). On 
account of the uniqueness of the solution of equations (3 - 1) through the point 
({, ij) or through the point (*, y), it follows readily that equations (6 12) have 
the solutions 

ij< = y<(€, X, y), 

and conversely. This property is closely related to those above. 
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